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(54) Title: A METHOD OF IMPROVING THE PRODUCTION OF BIOMASS OR A DESIRED PRODUCT FROM A CELL 
(57) Abstract 

The production of biomass or a desired product from a cell can be improved by inducing conversion of ATP to A DP without primary 
effects on other cellular metabolites or functions which is achieved by expressing an uncoupled ATPase activity in said cell and incubating 
the cell with a suitable substrate to produce said biomass or product. This is conveniently done by expressing in said cell the soluble 
part (F|) of the membrane bound (FoFi type) H*- ATPase or a portion of F| exhibiting ATPase activity. The organism from which the F| 
ATPase or portions thereof is derived, or in which the Fi ATPase or portions thereof is expressed, may be selected from prokaryotes and 
eukaryotes. In particular the DNA encoding Fi or a portion thereof may be derived from bacteria and eukaryotic microorganisms such as 
yeasts, other fungi and cell lines of higher organisms and be selected from the group consisting of the gene encoding the Fi subunit 0 or 
a portion thereof and various combinations of said gene or portion with the genes encoding the other Fi subunits or portions thereof. The 
method can be used i.a. for optimizing the formation of biomass or a desired product by a cell by expressing different levels of uncoupled 
ATPase activity in the cell, incubating the cell on a suitable substrate, measuring the conversion rate of substrate into biomass or the desired 
product at each level of ATPase expression, and choosing a level of ATPase expression at which the conversion rate is optimized. 
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A method of improving the production of biomass or a de- 
sired product from a cell 



This invention relates to a method of improving the pro- 
5 duction of biomass or a desired product from a cell by 
inducing conversion of ATP to ADP without primary effects 
on other cellular metabolites or functions. The invention 
also relates to a method of optimizing the production of 
biomass or a desired product from a cell utilizing this 
10 first method. The desired product may for example be lac- 
tic acid produced by lactic acid bacteria and ethanol or 
carbondioxide produced by yeast. 

BACKGROUND OF THE INVENTION 

15 

A wide range of microorganisms are used for the produc- 
tion of various organic compounds and heterologous pro- 
teins. One example hereof is the production of lactic 
acid and other organic compounds by the lactic acid group 
20 of bacteria, which results in the acidification and fla- 
vouring of dairy products, better known as cheese and 
yougurt production . 

From the microorganism's point of view, the organic com- 
25 pounds which are excreted from the cells are often merely 
the by-product of a process that is vital to the cells: 
the production of various forms of free energy (ATP, 
NAD ( P ) H , membrane potential , etc.)- Therefore, although 
many of the microorganisms which are being employed in 
30 these processes are reasonably well suited for the pur- 
pose, there is still a great potential for optimizing the 
productivity of these organisms when, looking from the 
bioreactor point of vue . Likewise, the production of het- 
erologous proteins by a microorganism is not what the or- 
35 ganism was adapted for and also here there is a potential 
for optimization. 
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Often when microorganisms are engineered for the purpose 
of optimizing an industrial production process, the reac- 
tions leading to the desired product will affect the 
delicate balance of co-factors involved in the energy me- 
tabolism of the cell. For instance if the glycolytic re- 
actions producing lactate from sugar were somehow to be 
enhanced (e.g. by overexpressing .the glycolytic enzymes) 
this would automatically lead to the convertion of ADP to 
ATP. The ratio between the concentrations of ATP and ADP 
is usually quite high in the growing cell ([ATP] /[ADP] > 
10), and when the ratio [ATP] /[ADP] changes, the sum of 
[ATP] and [ADP] still remains virtually constant. There- 
fore, if in the example above, the enhanced production of 
ATP changes the [ATP]/[ADP] ratio from 10 to say 30, this 
will only marginally affect the concentration of ATP. The 
ADP concentration however will change by a factor of 
three. The cells will then hardly feel the surplus of ATP 
but the ADP pool in the cells may be depleted to such an 
extent that reactions in which ADP is a co-factor or al- 
losteric regulator will be suppressed by the lack of ADP. 
The result may be that the total flux through the pathway 
(here through glycolysis) is only marginally increased. 
In the future, this situation is likely to occur more 
frequently, as the productivity of bioreactors are opti- 
mized by other means, and in these cases, it will be even 
more important (compared to the normal cell) to regener- 
ate the ADP from ATP, in order to further increase the 
productivity. 

Previously, attempts have been made to decrease the in- 
tracellular ATP concentration in yeast, employing sets of 
reactions which together form futile cycles, see EP pat- 
ent No. 245 481. Often, the first reaction of a futile 
cycle is part of the regular metabolic network of the 
cell, for instance the phosphorylation of a glycolytic 
intermediate, coupled to the utilisation of ATP. The sec- 
ond reaction, which may also sometimes be part of the 
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metabolic network, then de-phosphorylates the glycolytic 
intermediate without regenerating the ATP that was con- 
sumed in the first process, the overall effect being that 
a high energy phosphate bond is consumed. The limited 
5 'success that this strategy has had so far, is probably 
due to the fact that it is impossible to obtain a sig- 
nificant futile flux without decreasing the concentration 
of the phosphorylated intermediate, thereby , disturbing 
the cellular function and ultimately the growth. In addi- 
10 tion, when the approach is to decrease the concentration 
of a glycolytic intermediate, this will effectively re- 
move the substrate for the remaining part of the glycoly- 
sis, which will often result in a decreased flux through 
this pathway, rather than the desired increased flux. 

15 

Other strategies have been to use chemicals such as dini- 
trophosphate to stimulate the activity of the plasma mem- 
brane H + -ATPase by the addition of uncouplers of the mem- 
brane potential, or to genetically express the enzyme 

20 acid phosphatase in the cytoplasm, an enzyme that will 
remove phosphate groups from organic metabolites and pro- 
teins. However, both of these approaches suffer from^the 
same inherent problem: they are unspecific and a range of 
cellular reactions/concentrations may be affected. For 

25 instance, the acid phosphatase will remove phosphate 
groups from essential metabolites and proteins, thus dis- 
turbing various metabolic fluxes and metabolic regula- 
tion. The uncoupling of the plasma membrane H + -ATPase 
will disturb the intracellular pH in addition to the gra- 

30 dient of numerous ions across the cytoplasmic membrane . 
Besides, the addition of chemicals such as dinitrophos- 
phate is undesirable for most purposes. 

SUMMARY OF THE INVENTION 

35 

The idea of the invention is to use a highly specific and 
clean way to increase the intracellular level of ADP, 
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which does not suffer from the limitations described 
above: to express in a well-controlled manner an enzyme 
that has ATP-hydrolytic activity in the living cell with- 
out producing other products and without coupling this 
5 activity to energy conservation- Such an enzymatic activ- 
ity is of course not likely to be found in a normal cell, 
because the cell would then loose some of its vital en- 
ergy reservoir. 

10 Accordingly the present invention provides a method of 
improving the production of biomass or a desired product 
from a cell, the method being characterized by expressing 
an uncoupled ATPase activity in said cell to induce con- 
version of ATP to ADP without primary effects on other 

15 cellular metabolites or functions , and incubating the 
cell with a suitable substrate to produce said biomass or 
product. 

One of the normal enzymes that comes closest to the 
20 ideal ATP-hydrolyzing enzyme, is the membrane bound H - 
ATPase . This huge enzyme complex consists of two parts , 
the membrane integral part (F 0 ) and the cytoplasmic part 
(Fi) . Together the two parts couples the hydrolysis of 
ATP to ADP and inorganic phosphate (Pi), to translocation 
25 of protons accross the cytoplasmic membrane, or vice 
versa, using the proton gradient to drive ATP synthesis 
from ADP and Pi- 

The method of the invention is conveniently carried out 
30 by expressing in said cell the soluble part (Fi) of the 
membrane bound (FqFi type) H + -ATPase or a portion of the 
Fi exhibiting ATPase activity. 

The membrane bound H + -ATPase complex is found in similar 
35 form in prokaryotic as well as eukaryotic organisms, and 
thus Fi and portions thereof expressing ATPase activity 
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can be expressed in both prokaryotic and eukaryotic 
cells. 

The organism from which the Fl ATPase or portions thereof 
5 * is derived, or in which the Fl ATPase or portions thereof 
is expressed, may be selected from prokaryotes and eu- 
karyotes, in particular from bacteria and eukaryotic mi- 
croorganisms such as yeasts, other fungi and cell lines 
of higher organisms, in particular bakers and brewers 
10 yeast. 

A particularly interesting group of prokaryotes to which 
the method according to the invention can be implemented, 
i.a. in the dairy industry, are lactic acid bacteria of 

15 the genera Lactococcus , Streptococcus , Enterococcus , Lac- 
tobacillus and Leuconostoc , in particular strains of the 
species Lactococcus lactis and Streptococcus thermophi- 
lus. Other interesting prokaryotes are bacteria belonging 
to the genera Escherichia, Zymomonas , Bacillus and Pseu- 

20 domonas , in particular the species Escherichia coli f Zy- 
momonas mobilis , Bacillus subtilis and Pseudomonas pu- 
tida. 

In an expedient manner of carrying out the method accord- 
25 ing to the invention the cell is transformed or trans- 
fected with an expression vector including DNA encoding 
Fi or a portion thereof exhibiting ATPase activity under 
the control of a promoter functioning in said cell, and 
said DNA is expressed in the cell. Said DNA encoding Fi 
30 or a portion thereof may be derived from a prokaryotic or 
a eukaryotic organism, and it may be either homologous or 
heterologous to said cell. 

The Fi part of the bacterial H + -ATPase complex consists 
35 of several subunits that together are responsible for 
catalyzing ATP hydrolysis: the p-subunit is thought to 
carry the actual hydrolytic site for ATP hydrolysis, but 
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in vitro ATPase activity requires that the p-subunit 
forms a complex together with the a- and y-subunit 
(<*3Yp3)* T ^e activity of this complex is modulated by the 
e-subunit, so that the in vitro activity of the ct3Yp3e 
5 complex is five fold less than the (X37P3 complex. 

In a specific embodiment of the method according to the 
invention said DNA encoding Fi or a catalytic active por- 
tion thereof, is derived from Escherichia coli f Strepto- 

10 coccus thermophilics or Lactococcus lactis and is selected 
from the group consisting of the gene encoding the Fi 
subunit P or a catalytically active portion thereof and 
various combinations of said gene or portion with the 
genes encoding the Fi subunits 8, a, y and e or catalyti- 

15 cally active portions thereof. 

In particular said DNA encoding Fi or a portion thereof 
may be selected from the group consisting of the Es- 
cherichia coli g Streptococcus thermophilus and Lactococ- 
20 cus lactis genes a tpHAGDC (coding for subunits 5, a, y, 
p, s) , atpAGDC (coding for subunits a, y, p, e), atpAGD 
(coding for subunits a, y, p), atpDC (coding for subunits 
p, 8) and atpD (coding for subunit 0 alone) . 

25 Particularly interesting eukaryotes are the yeasts Sac- 
charomyces cerevisiae, Phaffia rhodozyma or Trichoderma 
reesei, and the DNA encoding Fi or a portion thereof may 
be derived from such organisms and is selected from the 
group consisting of the gene encoding the Fi subunit p or 

30 a portion thereof and various combinations of said gene 
or portion with the genes encoding the other Fi subunits 
or portions thereof. 

Vectors including DNA encoding the soluble part (Fi) of 
35 the membrane bound (F0F1 type) H + -ATPase or a portion of 
Fi exhibiting ATPase activity, derived from the lactic 
acid bacteria Lactococcus lactis and Streptococcus ther- 



WO 98/10089 




PCT/DK97/00373 



mophilus and from the yeasts Saccharomyces cerevisiae , 
Phaffia rhodozyma or Trichoderma reesei are also com- 
prised by the invention as well as expression vectors in- 
cluding such DNA under the control of a promoter capable 
5 'of directing the expression of said DNA in a prokaryotic 
or eukaryotic cell. 

Specific vectors according to the invention are plasmids 
including DNA encoding the soluble part (Fi) of the mem- 

10 brane bound (FoFi type) H + -ATPase or a portion of F^ ex- 
hibiting ATPase activity, said DNA being derived from 
Lactococcus lactis subsp. cremoris (SEQ ID No. 1), Lacto- 
coccus lactis subsp. lactis (SEQ ID No. 6), Streptococcus 
thermophilus (SEQ ID No. 10), Phaffia rhodozyma (SEQ ID 

15 No. 14), and Trichoderma reesei (SEQ ID No. 16). 

Further, the invention provides a method of optimizing 
the formation of biomass or a desired product by a cell, 
the method being characterized by expressing different 

20 levels of uncoupled ATPase activity in the cell, incubat- 
ing the cell on a suitable substrate, measuring the con- 
version rate of substrate into biomass or the desired 
product at each level of ATPase expression, and choosing 
a level of ATPase expression at which the conversion rate 

25 is optimized. 

Often, but not always, the optimization of a given prod- 
uct flux produced by a cell will entail the attainment of 
either maximum or minimum conversion rate of a substrate. 

30 

In an expedient manner of practicing this method of the 
invention a number of specimens of said cell are trans- 
formed or transfected with their respective expression 
vector each including DNA encoding a different portion of 
35 the cytoplasmic part (Fi) of the membrane bound (FoFi 
type) H + -ATPase up to and including the entire Fi, each 
portion exhibiting ATPase activity, said DNA in each ex- 
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pression vector being under the control of a promoter 
functioning in said ceil, incubating each cell specimen 
on a suitable substrate, measuring the conversion rate of 
substrate into biomass or the desired product in each 

5 specimen, and choosing a specimen yielding an optimal 
conversion rate. In a particular embodiment of this man- 
ner, which is especially suited for scientific studies, 
the promoter in each expression vector is an inducible 
promoter, and each cell specimen is grown at different 

10 concentrations of inducer in order to fine-tune the opti- 
mal conversion rate. 

In a preferred manner of practicing the above method of 
optimizing the performance of a cell a number of speci- 
15 mens of said cell are transformed or trans fected with 
their respective expression vector including DNA encoding 
a portion of the cytoplasmic part (Fi) of the membrane 
bound (FqFi type) H + -ATPase up to and including the en- 
tire Fi, said portion exhibiting ATPase activity, said 
20 DNA in the respective expression vectors being under the 
control of each of a series of promoters covering a broad 
range of promoter activities and functioning in said 
cell, incubating each cell specimen on a suitable sub- 
strate, measuring the conversion rate of substrate into 
25 biomass or the desired product by each specimen, and 
choosing a specimen yielding an optimal conversion rate. 
In a more preferred embodiment of this manner, which is 
well suited to establish an optimal production strain, 
the respective expression vectors include DNA encoding 
30 different such portions of Fi up to and including the en- 
tire Fi, each DNA in respective expression vectors being 
under the control of each of a series of promoters cover- 
ing a broad range of promoter activities and functioning 
in said cell. 

35 

Also in this method of the invention the DNA encoding a 
portion of Fi up to and including the entire Fi may be 
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derived from a prokaryotic or a eukaryotic organism, and 
it may be either homologous or heterologous to said or- 
ganism. The specific DNAs mentioned above may also con- 
veniently be employed in this method. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. A linear representation of the plasmids con- 
structed for modulating the cellular [ATP)/[ADP] ratio in 
10 E. coll (not drawn to scale). 

Figure 2 . Effect of induction of Fi-ATPase activity on 
the growth of E. coll in batch culture- Cells were grown 
for more than 10 generations in minimal medium supple- 
15 mented with glucose (0.4 g/1), ampicillin (0.1 g/1) and 
the indicated concentration of inducer, IPTG. 

Figure 3. Effect of ATPase expression on the intracellu- 
lar concentration of ATP and ADP (concentration in arbi- 
20 trary units), and on the ratio [ATP] /[ADP]. 

Figure 4 Effect of increased ATPase expression on ^the 
glycolytic flux. 

25 DETAILED DESCRIPTION OF THE INVENTION 

Many biosynthetic reactions in the living cell (anabo- 
lism) , require an input of free energy (ATP), which is 
generated through a series of degrading reactions (cata- 

30 bolism) . In the aerobic cell, there are two routes for 
ATP synthesis: 1) substrate level phosphorylation, where 
an energy rich phosphoryl group is transferred directly 
from a high energy intermediate metabolite to ADP, and 2) 
oxidative phosphorylation, where the free energy is first 

35 transformed into redox free energy by oxidizing the en- 
ergy source, then into a proton gradient by respiration 
and finally the proton gradient is used by the H + -ATPase 
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to drive ATP synthesis from ADP and inorganic phosphate. 
In other cases, e.g. anaerobic growth, there is only the 
first route, substrate level phosphorylation, that can be 
used for ATP synthesis. An example hereof is the homolac- 

5 tic LAB, where lactose is converted through the glyco- 
lytic pathway to lactic acid, which is excreted from the 
cells and thereby lowers the pH of the growth medium 
(usually milk products). With respect to ATP generation, 
homolactic fermentation is a very inefficient process, 

10 and only four moles of ATP are produced from 1 mole of 
lactose through substrate level phosphorylation. 

The anabolic (ATP consuming) and catabolic (ATP produc- 
ing) fluxes are normally well balanced in the living 
15 cell, and therefore, in the wild-type cell under normal 
growth conditions, the catabolic fluxes will be propor- 
tional to the anabolic fluxes. If a reaction is intro- 
duced that for instance hydrolyzes ATP in the cell and 
thereby lowers the cellular energy state (i.e. the 
20 [ATP]/[ADP] ratio), then either catabolism should in- 
crease or anabolism (growth) should decrease in order to 
make the consumption rate equal the production rate 
again. Which of these two scenarios will take place de- 
pends on whether, initially, the growth rate of the cell 
25 is limited through anabolism or through catabolism, i.e. 
whether there is a surplus or a shortage of energy in the 
cell to begin with. If there is a shortage of energy, 
then the rate of the anabolic reactions is limited by ca- 
tabolism and these reactions will be sensitive to changes 
30 in the cellular energy state. Introduction of an ATP- 
hydrolyzing reaction is then most likely to affect the 
growth rate of the cells. On the other hand, if there is 
a surplus of energy, then the growth rate will be limited 
mainly by the anabolic reactions; the. rate of anabolism 
35 will be insensitive to a decrease in the energy state, 
but the catabolic rate may increase due to a decrease in 
product inhibition at lower [ATP] /[ADP] ratio. 
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In vitro, the F± part of the H + -ATPase complex has been 
shown to have ATPase actitity, see above. But so far no- 
body has managed to use the F\ complex to stimulate the 
glycolytic flux, or even to show that the Fi complex can 
5 hydrolyze ATP in intact cells. Indeed, when we first 
tried to overexpress the Fl complex, consisting of the 
genes for the subunits a, y, P and e, this had virtually 
no effect on the growth of E. coll, even when the genes 
were transcribed from the maximally induced tac promoter 
10 and on a very high copy number vector (derived from 
pUC18) . One skilled in the art of gene expression in E. 
coll will appreciate that this combination is one of the 
most efficient expression systems that exists for this 
organism. 

15 

We then decided to try to express different combinations 
of subunits of the Fl complex, in order to see if other 
combinations of subunits would be more powerful . Plasmids 
were constructed containing various combinations of the 

20 genes encoding the Fi part of the bacterial FiFo -ATPase 
complex from E. coll. The genes were expressed, either 
from an inducible (lac-type) promoter at various concen- 
trations of inducer or from a series of constitutive pro- 
moters of varying promoter activity. These plasmids 

25 should express various levels of ATPase activity when in- 
troduced into the bacterial cell. Depending on which Fi 
genes are present on the plasmid and the strength of the 
promoter which is used to drive the expression,, we ob- 
served various degrees of inhibition of the growth of the 

30 cells harbouring these plasmids. Surprisingly, the beta 
subunit alone and in combination with the epsilon subunit 
turned out to be far more active in vivo than the entire 
Fl complex . 

35 The objective of this work was to affect the energy state 
of the cells, as reflected in the ratio [ATP]/[ADP], We 
therefore measured the intracellular concentration of ATP 
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and ADP in growing cells expressing various activities of 
Fi-ATPase. Indeed the ATP concentration decreased 
slightly with increasing ATPase activity and the ADP con- 
centration increased, and therefore the [ATP] /[ADP] ratio 

5 decreased (the effect on the ATP concentration was less 
than the effect on the ADP concentration as expected, see 
above). We also calculated the glycolytic flux through 
the cells with various levels of ATPase activity. We 
found that the flux through the glycolytic pathway was 

10 first stimulated with increasing expression of ATPase ac- 
tivity, until, a certain (optimal) ATPase activity which 
gave maximal glycolytic flux. Further increase of ATPase 
expression resulted in a lower glycolytic flux, due to a 
secondary effect of the ATPase activity on the growth of 

15 the cells. This emphasizes the need for optimization of 
gene expression rather than merely overexpressing the 
genes . 



EXAMPLE 1 

20 

,ATP hydrolysis and enhanced glycolytic flux in Escheri- 
chia coli, using an inducible promoter 

Restriction enzymes, T4 DNA polymerase, calf intestine 
25 phosphatase (CIP) were obtained from Pharmacia. 

Procedures for DNA isolation, cutting with restriction 
enzymes, filling in sticky DNA ends with T4 DNA po- 
lymerase in the presence of dATP , dCTP, dGTP and dTTP, 
30 treatment with calf intestine phophatase to remove phos- 
phate groups from 5' DNA ends and ligation of DNA frag- 
ments are carried out by standard methods as described by 
Maniatis et al. , 1982 . 



WO 98/10089 




PCT/DK97/00373 



13 



Extraction and measurement of ATP and ADP 

0.9 ml of cell culture was mixed with 0-9 ml of (80 °C) 
phenol (equilibrated with 10 mM Tris, 1 niM EDTA pH=8) and 
5 immediately vortexed vigorously for 10 seconds. After 1 
hour at room temperature the sample was vortexed again 
for 10 seconds and the two phases were separated by cen- 
trifugation at 14000 rpm for 15 minutes, and then resid- 
ual phenol in the water phase was removed by extraction 

10 with 1 volume of chloroform. ATP and ADP concentrations 
were then measured, using a lucif erin-lucif erase ATP 
monitoring kit (obtained from and used as recommended by 
LKB, except that 3 mM of phosphoenol-pyruvate was added). 
[ATP] was measured first. Subsequently the , ADP in the 

15 same sample was converted to ATP by adding pyruvate 
kinase, and [ADP] was recorded as the concomitant in- 
crease in luminescence. 

Construction of plasmids carrying combinations of the E . 
20 coli atp genes 

The following combinations of E. coli genes coding for-Fi 
subunits were chosen for expressing ATPase activity in E. 
coli: 1. atpAGDC (subunits a, y, p, e), 2. atpAGD (sub- 
25 units a, y, P), 3 . atpDC (subunits p, e), and 4. atpD 
(subunit P alone) . 

Cloning of fragments carrying atp genes onto pUC19 

30 The plasmid pBJC917 (von Meyenburg, K., et. al. f 19 84 ) 
which carries the entire atp operon was cut with 

1) the restriction enzyme Drain , and a 5009 bp DNA frag- 
ment containing the atpAGDC genes was isolated; 

35 

2) the restriction enzymes Drain and Tthllll, and a 4106 
bp DNA fragment containing the atpAGD genes was isolated; 
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3) the restriction enzymes Drain and SacII, and a 2364 
bp DNA fragment containing the atpDC genes was isolated; 

4) the restriction enzymes Aval and Tthllll, and a 1472 
5 * bp DNA fragment containing the atpD gene was isolated. 

In all four cases the fragments were then treated with T4 
DNA polymerase to create blunt ends, and subsequently the 
fragments were ligated into the cloning vector pUC19 
10 (Yanisch-Perron et ai., 1985) which had been cut with Smal 
and treated with CIP. 

The four ligation mixtures were transformed into the E. 
coli strain JM105 (Yanisch-Perron et ai., 1985), and the 

15 transformation mixtures were plated on LB ( Luria-Bertani 
broth; Maniatis et ai., 1982) plates containing 100 ng/ml 
ampicillin and 75 ng/ml 5-bromo-4-chloro-3-indolyl-p-D- 
galactoside (X-gal). In this strain background (JM105), 
plasmids formed by religation of pUC19 will give blue 

20 colonies , whereas plasmids that carry foreign DNA frag- 
ments inserted into the Smal site of pUC19, will give 
white colonies, A number of white colonies from the four 
transformations were therefore picked for further analy- 
sis: plasmid DNA was isolated and analysed by cutting 

25 with various restriction enzymes. Clones were identified 
from each of the four series which had the desired frag- 
ment inserted into the Smal site of pUC19, and in the 
proper orientation. These four plasmids were named, re- 
spectively: pATP-AGDC, pATP-AGD, pATP-DC and pATP-D, with 

30 reference to the specific atp genes carried by the plas- 
mid . 

Cloning combinations of the atp genes under the control 
of an inducible (tac) promoter 

35 

In order to be able to control the expression of the ATP- 
ase activity, we selected the expression vector pTTQ18 
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(Starck, 1987 ). This vector is a derivative of pUC18 
(Yanisch-Perron et ai., 1985), which carries a tac pro- 
moter and the lactose repressor gene, lacl. Immediately 
downstream of the tac promoter is a multiple cloning site 
5 (MCS; the polylinker from pUC18) in which genes can be 
inserted to be expressed from the tac promoter. The tac 
promoter is of the iac-type, i.e. repressed by the lac- 
tose repressor and inducible with isopropyl-{3-D- 
thiogalactoside (IPTG). 

10 

The four plasmids, pATP-AGDC, pATP-AGD, pATP-DC and pATP- 
D were cut with Kpnl and Xbal 9 which gave the four DNA 
fragments, 5023, 4120, 2378 and 1486 respectively. After 
purification, the fragments were ligated into the cloning 

!5 vector, pTTQ18, which had also been cut with Kpnl and 
Xbal (see figure 1). The ligation mixtures were trans- 
formed into E. colx K-12 MC1000 (Casabadan and Cohen, 
1980), and the transformation mixtures were plated on LB 
plates containing 100 ng/ml ampicillin. A number of colo- 

20 nies from the four transformations were therefore picked 
for further analysis: plasmid DNA was isolated and ana- 
lysed by cutting with various restriction enzymes. Clones 
were identified from each of the four series which had 
the desired fragment inserted into the MCS of pTTQ18 in 

25 the proper orientation. These four plasmids were named, 
respectively: pTAC-AGDC, pTAC-AGD, pTAC-DC and pTAC-D, 
with reference to the specific atp genes carried by these 
plasmid and the tac promoter used for their expression. 
For the purpose of subsequent physiological studies, the 

30 plasmids were transformed into the E. coll K-12 strain 
LM3118, which is used routinely for physiological experi- 
ments in this laboratory. The corresponding names for the 
LM3118 strain carrying these four plasmids are PJ4332, 
PJ4333, PJ4335 and PJ4334, respectively. 



35 
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Effect of induction of ATPase activity on the growth of 
E. coli on plates 

The strains containing the four plasmids were streaked on 

5 LB plates containing ampicillin (100 *xg/ml) and 1 mM of 
IPTG which should give maximum expression from the tac 
promoter. Table I shows how the four strains responded: 
the strain carrying plasmid pATP-AGDC, which contains the 
genes for the four subunits, a, y, p and e, was only very 

10 slightly affected in growth, even in the presence of 1 mM 
IPTG* The other three plasmids, pTAC-AGD, pTAC-DC and 
pTAC-D caused severe growth inhibition in the presence of 
1 mM IPTG, where colonies were no longer visible. With 
intermediate concentrations of IPTG, 0.01 mM and 0.1 mM, 

15 the plasmids affected the growth of their host cells to 
different extents: pTAC-AGD was the most active, giving 
rise to a strong inhibition of growth already with 0.01 
mM IPTG, a concentration which gave only a slight inhibi- 
tion with the plasmid pTAC-DC and no inhibition of the 

20 strain with pTAC-D. With 0.1 mM IPTG, colonies were 
hardly visible for the strain that carried the pTAC-AGD, 
the plasmid pTAC-DC caused strong growth inhibition, 
whereas the effect of pTAC-D was significant but small. 

Table I 

Strain Plasmid - IPTG 0.01 mM IPTG 0.1 mM IPTG 1 mM IPTG 



PJ4332 p TAC -AG DC ++++ ++++ ++ * + +++ 

PJ4333 pTAC-AGD ++++ ++ + 

PJ4335 pTAC-DC ++++ +++ + 

PJ4334 pTAC-D ++++ ++ 

++++ = normal colony size; +++ = slight inhibition; ++ = 1/2 normal size; 



+ = 1/10 normal size; - = no growth 



The effect of ATPase expression from the four plasmids 
above was also studied in the E, coli mutant LM3115, in 
which the entire atp operon on the chromosome is deleted, 
but which grows with almost wild- type growth rate on LB 
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medium. With this strain transformed with the four plas- 
mids we observed a similar pattern of growth inhibition 
on LB plates as a function of IPTG concentration. This 
shows that the effect of ATPase expression was independ- 
5 ' ent of the presence of the normal atp operon. 

Effect of induction of ATPase activity on the growth of 
E* coll in liquid cultures 

10 The effect of induction of ATPase was also studied with 
cells grown in liquid cultures. For this purpose we chose 
.the strain PJ4333, carrying the plasmid pTAC-AGD, because 
this plasmid appears to be the most active with respect 
to the inhibitory effect on the of growth . of E. coll. 

15 Figure 2 shows the growth of PJ4333 in minimal medium 
supplemented with a limiting concentration of glucose 
(0.4 g/1) and ampicillin (0.1 g/1), without IPTG and in 
the presence of increasing concentrations of IPTG. We ob- 
served that the growth rate of the strain was practically 

20 constant (within some 10%) with increasing amounts of 
IPTG up to about 30 *iM. At higher than 40 *iM IPTG,^ the 
growth of the cells were slightly inhibited , in accor- 
dance with the experiments on platfes, see above. 

25 However, what was affected was the final density of cells 
that one obtains from the limited amount of glucose that 
was included in each culture: The more ATPase that is ex- 
pressed in the cells, the lower the yield of cell mass. 
Apparently, the cells become less economic with respect 

30 to converting the glucose into biomass, or in other words 
they consume more glucose per cell synthesized . If this 
is due to the expression of ATPase activity, then we 
would expect to see an effect hereof on the energy state 
of the cells. We therefore measured the concentrations of 

35 ATP and ADP in the cells growing with different expres- 
sion levels of ATPase activity. 
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Indeed, the intracellular ATP concentration decreased 
gradually and the ADP concentration increased, with in- 
creased expression of ATPase; therefore the [ATP] /(ADP] 
ratio decreased with increased expression of ATPase, 

5 which imply that the increased glucose consumption is the 
result of increased ATP convertion to ADP, see figure 3. 
The actual flux of glucose through the cells (Jgiuc/ nunol 
glucose / g cell dry weight 7 hour) is also interesting, 
because this value tells us whether the performance of 

10 the cell increased as the ATPase activity increased. 
Jgluc can be calculated from the yield, Y (g cell dry 
weight / mol glucose) and the specific growth rate of the 
culture, \x (1/hours): 

15 Jgluc 3 H/Y 

Figure 4 shows how the flux of glucose changed as the ac- 
tivity of ATPase increased: the glycolytic flux increased 
gradually as the ATPase expression increased, until a 

20 maximum was reached (at 30 nM IPTG) . Further increase of 
ATPase expression had a slightly negative effect on the 
glucose flux. This was probably because the energy state 
of the cells became so low that this had a negative ef- 
fect on some anabolic reactions, since the growth rate 

25 was lower for the culture that was grown in the presence 
of 40 nM IPTG. 

The expression of subunits of the Fi part of the bacte- 
rial H + -ATPase lowers the energy state of the bacterial 

30 cell. This is due to hydrolysis of ATP into ADP and Pi- 
The expression of ATPase activity does not affect the 
growth rate of E* coll much at low levels of expression/ 
but the efficiency by which the substrate is converted 
into biomass was strongly reduced. Under the set of con- 

35 ditions used here, the expression of ATPase activity has 
a stimulatory effect on the rate by which the cells con- 
sumes the exogenous glucose. 
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EXAMPLE 2 

Expression of Fi-ATPase activity from constitutive pro- 
moters in E. aoli 

5 ' 

In example 1 we used a lac-type promoter system to modu- 
late the expression of the Fi ATPase subunits in E. coli. 
However, for the optimization of gene expression for in- 
stance in industrial bioreactors or for the use in fer- 

10 mented food products, the use of lac type promoters is 
not always feasible. In this example we illustate the op- 
timization of Fi -ATPase expression in E. coll, using a 
series of constitutive promoters of different strength, 
to control the expression of the atpAGD genes which here 

15 originates from E. coli. The constitutive promoters (CP 
promoters) were selected from a library of artificial 
promoters which had previously been cloned onto a shuttle 
vector for E. coll and L. lactls, pAK80 (Israelsen et 
al. 9 1995) as described in our co-pending PCT patent ap- 

20 plication PCT/DK97/00342 . The selected plasmid deriva- 
tives of pAK80 were pCP34, pCP41 and CP44 (CPX cloning 
vectors). The atpAGD fragment from pTAC-AGD (from example 
1) was first subcloned in a polylinker in order to have 
the atpAGD fragment flanked by two BamHI sites, Subse- 

25 quently, this BamHI fragment was cloned into the unique 
BamHI site downstream of the CP promoters on the plasmids 
pCP34, pCP41 and CP44, resulting in the plasmids, 
pCP34: : atpAGD, pCP34 : : latpAGD, pCP4 1 :: atpAGD and 

CP44 atpAGD, where pCP34 : : 2 atpAGD contains two atpAGD 

30 fragments in tandem. 

Subsequently, the strains were characterized with respect 
to growth rate, growth yield and glycolytic flux in glu- 
cose minimal medium supplemented with 200 ^ig/ml erythro- 
35 mycin, essentially as described in example l f see table 
2. 
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The expression of the Fi-ATPase subunits had a s Light ly 
negative effect on the growth rate as the expresion level 
increased . The effect on growth yield was much stronger 
and at the highest expression level the growth yield had 
5 dropped to 40 % of the initial value. The glycolytic flux 
was stimulated 70% at the highest expression level of AT- 
Pase, and at this expression level the growth rate was 
lowered by 30%. 

Table 2. Effect of expression of uncoupled F^-ATPase activ- 



ity (£. coli 


a, y f P 


subunits ) 


in E. coli 






Plasmid 


Biomass 


Growth 


Glucose 


Biomass 


Growth 


Glucose 




yield 


rate, u 


flux 


yield 


rate 


flux 




gdw/ mmol 


h-1 


mmol glu- 










glucose 




cose/h/ gdw 








pCP41 


0,067 


0, 47 


6,9 


100 


100 


100 


pCP41 : : atpAGD 


0, 047 


0,42 


9,1 


69 


90 


131 


pCP34 


0, 063 


0,41 


6, 6 


100 


100 


100 


pCP34 : : atpAGD 


0,034 


0, 34 


9,9 


54 


81 


149 


pCP44 


0, 067 


0,44 


6,5 


100 


100 


100 


pCP44 : : atpAGD 


0, 027 


0, 30 


11,2 


40 


69 


172 



EXAMPLE 3 

Expression of E. coli Fi-ATPase activity from constitu- 
5 tive promoters in L. lactis. 

The plasmids from example 2 which express the E. coll 
Fi-ATPase subunits to various extent are also capable of 
replicating in L. lactis, and could therefore be used to 
10 test whether the E. coll Fi-ATPase subunits can be used 
to hydrolyse ATP in L. lactis. 



WO 98/10089 




PCT/DK97/00373 



The plasmids pCP34 : : atpAGD, pCP34 : : 2atpAGD and 
pCP4 1 : : atpAGD, were transformed into the L. lactis sub- 
species cremoris strain, MG136 3, which is used routinely 
for physiological experiments in this laboratory. In ad- 
5 'dition we transformed the respective vectors, pCP34 and 
pCP41 in order to have proper control strains. Subse- 
quently, the resulting- trans formants were characterized 
with respect to growth rate, growth yield and glycolytic 
flux, in comparison to the respective vectors, pCP34 and 
10 pCP41, by growing the various cultures in defined medium 
(SA medium) supplemented with a limiting concentration of 
glucose (0.1%), see table 3. 



Table 3. Expression of E. coll Fi-ATPase in L . lactis 



Plasmid 


Bioraass 
yield 

gdw/mmol 
glucose 


Growth 
rate, p 

h-1 


Glucose 
flux 

mmol giu- 
cose/h/gdw 


Bioraass 
yield 

% 


Growth 
rate 

> 


Glucose 
flux 


pCP34 




0, 073 


0, 664 


9, 161 


100 


100 


100 


pCP34 : 


: a tpAGD 


0, 071 


0, 653 


9, 230 


97, 5 


98,3 


100, 8 


pCP34: 


: 2 atpAGD 


0, 069 


0, 655 


9, $60 


94, 6 


98,7 


104, 4 


pCP41 




.0, 072 


0, 645 


8,925 


100 


100 


100" 


pCP41: 


: a tpAGD 


0, 070 


0, 590 


8, 461 


96, 5 


91,5 


94, 8 



The results show that the plasmids pCP34 :: atpAGD and 
pCP34 : : 2atpAGD did affect the growth yield and the glyco- 
lytic flux to some extent, but the plasmids were far less 

5 efficient in L. lactis, compared to E. coll. This was 
probably a consequence of a lower expression of the E. 
coll ATPase subunits, or some of these, in L. lactis, due 
to a lower copy number of the pAK80 vector in L. lactis 
(5-10), and due to differences in the trans lational 

0 effciency of the three individual atp genes which origi- 
nates from E. coli. The plasmid pCP4 1 :: atpAGD also re- 
sulted in a lower growth yield, indicating that also in 
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this case uncoupled ATP hydrolysis was taking place. How- 
ever, the pCP4 1 : : atpAGD plasmid had a relatively strong 
inhibitory effect on the growth rate and therefore the 
glycolytic flux was not increased by this plasmid. It is 
5 possible that the heterologous expression of E . coli AT- 
Pase subunits resulted in growth inhibition due to ef- 
fects other than ATP hydrolysis, e.g. by interfering with 
the function of the L. lactis F\Fq H + -ATPase complex. 

10 EXAMPLE 4 

Expression of L. lactis Fi-ATPase subunits p and e, in L, 
lactis. 

15 In the example above we showed that the expression of Fi- 
ATPase subunits from E. coli in L. lactis , resulted in 
only a small stimulation of the glycolytic flux. It is 
possible that the heterologous expression of E. coli AT- 
Pase subunits resulted in growth inhibition due to ef- 
20 fects other than ATP hydrolysis, e.g. by interfering with 
the function of the L. lactis FiFq H + -ATPase complex. In 
the present example we have expressed the L. lactis Y\- 
ATPase subunits, P and e, in L. lactis, as this appeared 
to be an effective combination of subunits when expressed 
25 in E. coli, see example 1. The a tpDCLlc genes from L. 
lactis subspecies cremoris (SEQ ID No. 1) was cloned on a 
2.5kb Hindlll fragment into the Hindlll restriction site 
on the standard cloning vector, pBluescript, into E. coli 
K-12, strain BOE270. Subsequently, the atpDCLlc genes 
30 were cut out on a 2.5kb BamHI-Sall fragment and cloned 
into 5 expression vectors, pCP32, pCP34, pCP37, pCP41 and 
pCP44 which had been digested with BawHI and Sail, re- 
sulting in the plasmids pCP32 : : a tpDCLlc / pCP34 : : a tpDCLlc / 
pCP37 : :atpDC L lcf pCP4 1 : : a tpDC L lc and pCP44 : : a tpDCLlc / re ~ 
35 spectively, where the lacLM genes downstream of the CP 
promoters, have been replaced with the atpDCLlc genes. 
These plasmids should express the L. lactis Fi-ATPase 
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subunits, P and e, to different extent. The plasmids were 
then transformed into MG1363 with selection for the 
erythromycin resistance carried by these vectors. Experi- 
ments were then performed to test whether the constructs 

5 'resulted in conversion of ATP into ADP in L. lactis. The 
strains carrying the different constructs was then grown 
in GM17 medium supplemented with 5 (jig/ml erythromycin. 
The plasmids did not have a strong effect on the growth 
rate of the cultures, which remained close to the growth 

10 rate of the respective vector control plasmids. The yield 
of biomass, however, decreases for all the cultures by up 
to 17%, which shows that the cultures did indeed express 
uncoupled ATPase activity, see table 4. 

15 Table 4. Effect of expression of L. lactis P and e 

subunits on acid production by L . lactis, at 30°C and ^ 
with initial pH 6.7. 



Plasmid 


Biomass' 

OD 4 so 


Final pH* 


Acid formation, 
relative to biomass 

£ of vector 


pCP34 




5 


08 


4,27 


100 


pCP34 


: : atpDCllc 


4 


72 


4.31 


98.0 


pCP41 




4 


66 


4 . 34 


100 


pCP41 


: : atpDCllc 


5 


21 


4.24 


113.5 


pCP37 




4 


89 


4.28 


100 


pCP37 


: : atpDCllc 


4 


63 


4 .24 


116.1 


pCP32 




4 


86 


4 . 34 


100 


pCP32 


: : atpDCllc 


3 


95 


4 . 36 


116.9 



Each value is the average of 3-4 independent cultures. The acid 
20 production was calculated from the pH change, and normalized by the 
biomass produced. 

The GM17 growth medium used in these experiments contains 
a surplus of glucose (1%) , and growth only stops when the 
25 pH of the growth medium becomes lower than approximately 
pH 4.3. To some extent, this mimics the situation that 
the lactic acid bacteria experience during cheese and 



WO 98710089 



24 



PCT/DK97/00373 



yougurt production. In this medium, the growth yield, in 
terms of the final cell mass of the cultures, reflects 
the acid production by. these cultures. 

5 In these cultures, the expression of Fi-ATPase subunits 
will increase three fold at approximately OD600 equal to 
1.5. This is a consequence of the three fold amplifica- 
tion of the plasmid copy number that has been shown to 
take place at this point of the growth curve. In reality, 

10 the effect of expressing the Fi-ATPase subunits may 
therefore be larger. 

To test this hypothesis, we grew some of the strains 
which expressed the L. 'lactis F\ -ATPase subunits p and e 
15 in batch cultures of GM17 medium which had been adjusted 
to pH 5.9, see Table 5. In addition, the temperature of 
the growth medium may also affect the plasmid copy number 
and thus the expression of the F t -ATPase subunits. The 
experiments were therefore performed at 37°C. 

20 

Table 5. Effect of expression of L. lactis p and e 
subunits on acid production by L. lactis, at 37°C and 
with initial pH 5.9. 



Plasmid 


. ! 

Biomass 

OD« 5 o 


Ffinal pH* 


Acid formation, 
relative to biomass 

r i of vector 


pCP34 




1.24 


4.95 


100 


pCP34 


: :atpDC Uc 


1.06 


4.87 


141.. 4 


pCP37 




1.00 


4.96 


100 


pCP37 


: :atpDC llc 


0. 58 


4.92 


188.4 



25 

Clearly, the effect of the Fi-ATPase activity was much 
stronger under these growth conditions: the amount of 
acid produced was almost doubled for the strain carrying 
the plasmid pCP37 : : a tpDC iic . 
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EXAMPLE 5 

Expression of the Fi-ATPase subunits, a, y, and p, from 
L* lactls subspecies cremoris in 1*. lactis subspecies 
5 cremoris . 

In example 4, only the L. lactls F i-ATPase P and z 
subunits were expressed in L. lactls. However, from the 
experiments with E. coll (example 1), we know that the 

10 simultaneous expression of subunits a, y, and p, is a 
more powerful combination, which could also be the case 
for L . lactls. In order to obtain the same strong stimu- 
lation of the glycolytic flux and acid production in L. 
lactls, a set of vectors . similar to the vectors described 

15 in example 4 was constructed, in which the atpAGE>u± c 
genes derived from L. lactls, encoding the subunits ct, : y , 
and p (SEQ ID No. 1) was expressed from CP promoters with 
different activities* The atpAGE^ic genes from L. lactls 
was cloned on a 2.5 kb BamHI-Sall fragment into the 5 

20 vectors, pCP32, pCP34, pCP37, pCP41 and pCP44, resulting 
in the plasmids, pCP32 : : a tpAGDj^ic , pCP34 : : atpAGD^iQ / 
pCP37 : :atpAGDLic, pCP4 1 : : a tpAGD^ic , pCP44 : : a tpAGC^lc / re- 
spectively, where the lacLM genes downstream of the CP 
promoters, has been replaced with the atpAGD^\ c genes. 

25 These plasmids will express the L. lactls Fi-ATPase 
subunits a, y, and p, to different extent. The plasmids 
were transformed into MG1363 with selection for the 
Erythromycin resistance carried by these vectors. Experi- 
ments were then performed to show that the constructs 

30 were effective in ATP hydrolysis in L. lactls and to what 
extent the glycolytic flux was enhanced, by growing the 
five different constructs in GM17 medium supplemented 
with erythromycin, and measuring the growth rate, ATP and 
ADP concentrations, the yield of biomass and the rate of 

35 acid production . 
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EXAMPLE 6 

Expression of Fi-ATPase subunits from L. lactls subsp 
lactis, in L. lactis subspecies lactis. 

5 

In the examples 3-5 above, we used the strain L, lactis 
subsp. cremoris, MG1363. This strain is plasmid-free and 
is used routinely in our laboratory as a simple model or- 
ganism for our physiological studies. But strains belong- 

10 ing to the subspecies lactis are also important in cheese 
production. We therefore cloned and sequenced the at- 
pAGD^n genes from L. lactis subsp. lactis, ( SEQ ID No. 
6). Subsequently, a 4.2 kb fragment habouring the at- 
pAGD^n genes was cloned into 5 vectors, pCP32, pCP34, 

15 pCP37, pCP41 and pCP44, resulting in the plasmids, 
pCP32: ratpAGDLii, pCP34 : zatpAGD^n , pCP37 : : a tpAGD^n , 
pCP41: iatpAGDi.il r pCP44 : : atpAGD^n , respectively. These 
plasmids were then transformed into L. lactis subsp. lac- 
tis as described in example 3. The resulting strains with 

20 different expression levels of the Fi-ATPase subunits a, 
y and p were then used to characterize the effect hereof 
on the growth yield, growth rate, glycolytic flux, and 
the cellular energy state of L. lactis subsp. lactis, as 
described in the examples 1-5. 

25 

EXAMPLE 7 

Expression of Fl-ATPase 6ubunits from S. thermophilus, 
ST3 , in S. thermophilus, ST 3 

30 

In the examples 3-6 above, we used strains of the genus 
Lactococcus • These strains are important in cheese pro- 
duction. As starter cultures for yougurt production, the 
dairy industry often uses strains of S. thermophilus . We 
35 therefore cloned and sequenced the atpAGDst genes from S. 
thermophilus , strain ST3 (SEQ ID No. 10). Subsequently, a 
4.2 kb fragment habouring the atpAGDst genes was cloned 
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into the 5 vectors, pCP32, pCP34, pCP37, pCP4 1 and pCP44, 
resulting in the plasmids, pCP32 : : a tpAGD^ t 

pCP34 : latpAGDst* pCP37 : : atpAGD$t / pCP4 1 : : a tpAGD s t / 

pCP44 : : atpAGDst / respectively. These plasmids were then 
5 ' transformed into S. thermophilus strain ST3 . The result- 
ing strains have different expression levels of the Fi- 
ATPase subunits a, y, and P, and were then used to char- 
acterize the effect hereof on the growth yield, growth 
rate, glycolytic flux, and the cellular energy state of 
10 S. thermophilus , as described in the previous examples. 

EXAMPLE 8 

Expression of a truncated F^-ATPase P subunit from Phaf- 
15 fia rhodozyma in Saccharomyces cerevisiae 

In this example we show that uncoupled F^-ATPase expres- 
sion can also be used to hydrolyze ATP in yeast cells of 
Saccharomyces cerevislae, 

20 

A cDNA gene library was prepared from total RNA, isolated 
from Phaffia rhodozyma, by cloning the cDNA fragments 
into the expression vector, pYES2.0. One of the resulting 
plasmids, pATPbeta, gave rise to an ade 4 " phenotype in the 

25 Saccharomyces cerevisiae strain, W301 , which carries a 
mutation in the ADE 2 gene. Sequencing of the clone re- 
vealed a 0.9 kb insert, which encoded a protein of 2 54 
amino acids (SEQ ID No. 14). The encoded protein had a 
very high homology to the C-terminal part of Fi-ATPase p 

30 subunits from other organisms, prokaryotic as well as eu- 
karyotic, including the P subunit from S. cerevislae (86% 
identity) . 

The ADE2 mutation results in starvation for an intermedin 
35 ate further down in the purine metabolism, AICAR (which 
under normal conditions is produced by ADE3 r two steps 
further down in this pathway). AICAR is essential for de 
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novo biosynthesis of AMP and GMP, and KDE2 mutants there- 
fore need an alternative purine source in the growth me- 
dium. However , there is an alternative route for synthe- 
sis of AICAR which involves some of the genes involved in 

5 histidine biosynthesis. These genes are normally re- 
pressed under the conditions used for the complementa- 
tion, but when the HIS3 gene is introduced on a piasmid, 
this complements the ADE2 mutation because the cells 
start to produce AICAR. Since AICAR is a precursor for 

10 ATP, it is likely that a lack of ATP (or increased levels 
of ADP and AMP) provide a signal to derepress the HIS3 
gene and generate AICAR (which will subsequently end up 
as ATP) . Indeed, cross-pathway regulation between purine 
and histidine biosynthesis has been found in yeasL and 

15 involves the transcription factors BAS1 and BAS2. A rea- 
sonable explanation for the ade + phenotype conferred by 
the piasmid, is therefore that the piasmid gives rise to 
ATP hydrolysis in the cytoplasm, thereby effecting the 
concentrations of adenine nucleotides in the cytoplasm. 

20 

Importantly, this truncated p subunit from Phaffia r/jo- 
dozyma that was encoded on pATPbeta, included the region 
of the P subunit which is thought to encode the catalytic 
site for ATP hydrolysis. The truncation of the N-terminal 
25 part of the P subunit probably means that the protein 
will no longer be exported into the mitochondrion, but 
should stay within the yeast cytoplasm. 

The . truncated P subunit pATPbeta is expressed from a gal 
30 promoter, i.e. it can be induced with galactose. If the 
truncated p subunit encoded by the clone is active in ATP 
hydrolysis it should result in a decrease in the growth 
yield, and at sufficiently high expression level, we 
should also observe inhibition of growth. The strain 
35 which expressed the truncated p subunit and a control 
strain (which contained a piasmid pHIS3 containing a HIS3 
gene from Phaffia rhodozyma) , were streaked on plates 
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containing galactose as the energy source, which will 
give maximal expression of the truncated P subunit. In- 
deed, the growth of the strain which expressed the trun- 
cated P subunit was strongly inhibited by the presence of 

5 * galactose, whereas the control strain grew normally. As a 
control, the growth of the two strains were also compared 
on a plate containing glucose as the energy source, con- 
ditions under which the expression of the P subunit 
should be repressed, and indeed we observed little dif- 

10 ference in growth of the two strains on these plates, see 
table 6 • 

Subsequently, for the purpose of the physiological inves- 
tigations, the two strains were converted into Rho~ 

15 strains (petit mutants, defective in oxidative phosphory- 
lation) by standard treatment with ethidium bromide. The 
induction with galactose caused even stronger inhibition 
of growth in the Rho" background, which further indicates 
that the cause of the growth inhibition is uncoupled ATP 

20 hydrolysis in the cytoplasm. 

Table 6. Effect of expression of a truncated Fi-ATPase P 
subunit from Phaffla rhodozyma in S. cerevlsiae cft\ SC 
plates 

25 



Strain/plasmid 


SC-ura + glucose 


SC-ura + galactose 


Rho + /pATPbeta 


+++++ 


+ 


Rho + /pHIS3 


+++ + 


+++ 


Rho~/pATPbeta 


+++ + + 




Rho~/pHIS3 


++ + + 


+++ 



Growth experiments were performed to measure the result- 
ing changes in the ATP/ADP ratio and the degree of stimu- 
lation of the glycolytic flux and ethanol formation, es- 
30 sentially as described in the examples above, and to show 
that the truncated p subunit from Phaffia rhodozyma is 
active with respect to converting ATP into ADP in the 
yeast cell. 



wo 9mww 



30 



EXAMPLE 9. 

Expression of Fi-ATPase P subunit from Trichoderma reesei 
in Saccharomyces cerevisiae. 

5 

In this example we show that the expression of the Fi- 
ATPase P subunit from the filamentous fungus, Trichoderma 
reesei can be used to improve the product formation of 
Saccharomyces cerevisiae. 

10 

The gene encoding the Fi-ATPase p subunit homologue from 
Trichoderma reesei was isolated from a cDNA library, in- 
serted into a multicopy expression vector, pAJ401. DNA 
sequencing (SEQ ID 16) revealed that the cloned gene had 

15 very high homology to the p subunits from Neurospora 
crassa (91% identity), Kluyveromyces lactis (681) and 
Saccharomyces cerevisiae (68%) . Importantly, the first 43 
amino acids in this P subunit, which encodes the signal 
for exporting the protein into the mitochondria, was ho- 

20 mologous to the N-terminal part of the p subunit from 
Neurospora crassa {58% identity), but not to that of Sac- 
charomyces cerevisiae. It is therefore likely that the p 
subunit from Trichoderma reesei will stay within the cy- 
toplasm when expressed in Saccharomyces cerevisiae. This 

25 - is important for the many cases where the fermentation is 
carried out anaerobically, because in these cases it is 
probably most efficient if the ATP hydrolysis takes place 
in the cytoplasm. Alternatively, in those cases where the 
p subunit is transported into the mitochondrion, it may 

30 be useful to genetically modify the p subunit so that is 
stays within the cytoplasm. 

The gene encoding the F^-ATPase p subunit homologue from 
Trichoderma reesei was expressed in 5, cerevisiae strain 
35 VWlb (MAT alpha, leu2-3/112, ura3-52, trpl-289, h±s3Dl, 
MAL2-8c f SUC 2) . To test whether the presence of the T. 
reesei P subunit resulted in ATP hydrolysis in the cyto- 
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plasm of the Saccharomyces cerevisiae host cells, we 
measured the intracellular concentrations of ATP, ADP and 
AMP, under various growth conditions in cultures of two 
strains expressing the P subunit (pATPP34 and pATP044) 
5 * and a strain carrying the vector plasmid, pFL60, see ta- 
ble 1. 

Table 7. Effect of expression of T. reesei P subunit on 
ATP, ADP and AMP concentrations* in S. cerevisiae 

w 



Strain 


ATP 


ADP 


AMP 


ATP/ADP ratio 




pmol/gdw 


Vimol/ gdw 


pmol/gdw 




Aerobic/exp . phase 










pATPP3 4 


19.3 


5.58 


3. 31 


3.5 


pATPP44 


13.9 


5. 15 


3.25 


2.7 


pVECTOR 


16. 6 


5.47 


3 . 43 


. 3.0 


Aerobic/s ta t .phase 










pATPP34 


9.30 


4 . 03 


2 .89 


2.3 


PATPP44 


8.99 


3. 90 


2. 42 


2.3 


pVECTOR 


19. 5 


4.62 


2.87 


4.2 


anaerobic/ s tat .phase 










PATPP34 


4.39 


11.6 


6.72 


0 . 4 


pATPP44 


3.14 


10. 5 


6. 65 


0.3 


pVECTOR 


8.84 


10. 2 


6. 37 


0.9 



according to Bergmeyer (1985) 



The P subunit did not appear to have a significant effect 
on the concentrations of ATP, ADP and AMP in cells grow- 

15 ing on glucose in the exponential growth phase. The rea- 
son is probably that the ATP concentration that the ho- 
meostatic control of ATP synthesis can here keep up with 
the extra drain on ATP conferred by the P subunit F,- 
ATPase activity. Indeed, the growth rate of these cul- 

20 tures was unaffected by the presence of the F t -ATPase ac- 
tivity, see table 7. But in the stationary cultures the 
concentration of ATP decreased significantly in the cul- 
tures expressing the P subunit, compared to the control. 
The effect was strongest in the anaerobical ly grown cul- 

25 tures where the ATP was lowered by a factor of 2-3. In 
these cultures, ATP must be generated through oxidative 
phosphorylation, (which is not even an option for the an- 
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aerobic cultures), and any effect of uncoupled ATP hy- 
drolysis should therefore indeed be stronger in these 
cells . 

5 Shake flask cultivations of cultures expressing the F x - 
ATPase P subunit homologue in Saccharomyces cerevisiae. 

Shake flask cultivations were performed under microaero- 
bic/anaerobic conditions with volume ratio 1/1.25 and no 

10 agitation; with 400 ml growth media in 500 ml Erlenmeyers 
on magnetic stirring. The growth media contained 5 g/1 of 
glucose and amino acids and bases according to synthetic 
complete medium (SOura + 0 . 5%G) . OD.-u, was monitored during 
the cultivation (OD600=1.0 is equal to 0.3 g/1 dry 

15 weight) . Ethanol and glucose were measured with HPLC 
(Waters, Sugar-Pak or IC-Pak columns) . Production of 
ethanol (grams of ethanol per grams of cell dry weight) 
is shown in Table 8. 

20 Table 8. Effect of expression of T. reesei (3 subunit, 
on fluxes of ethanol and glucose in s. cerevisiae 



Strain 


P 








J«toh 




h" 1 


g/h/ gdw 


g/h/gdw 


relative 
to con- 
trol 


relative to 
control 


pATP(J34 


0.40 


2.811 


1 .190 


107.7 


105.6 


PATPP4 4 


0.40 


2.750 


1.187 


105.3 


105.3 


pVECTOR 
control 


0.39 


2 .611 


1.127 


100 


100 



These data show that the presence of the T. reese.i F,- 
25 ATPase p subunit resulted in an increased flux of glu- 
cose, as well as ethanol, in the Saccharomyces cerevisiae 
host cells. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Peter Ruhdal JENSEN 

(B) STREET: Soegaards ve j 19 

(C) CITY: Gentofte 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP):. DK-2820 

(ii) TITLE OF INVENTION: A method of improving the production of 
biomass or a desired product from a cell 

Uii) NUMBER OF SEQUENCES: 17 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy, disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE : Patentln Release ttl.O, Version 1*1.30 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DK 963/96 

(B) FILING DATE: 06-SEP-1996 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481S base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis subsp . cremoris 

(B) STRAIN: MG1363 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 26. . 550. 

(D) OTHER INFORMATION: /codon_start = 26 
• /product^ "ATPase subunit". 
/ge.ne= "atpH" 

/ st 2ndard__name= "delta subunit of the Fl portion 
of trie F0F1 ATPase" . 
/label= del ta-subunit 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION : 742 2241 

(D) OTHER INFORMATION: /codon_s tart= 742 
/product^ "ATPase subunit" 
/gene= "atpA" 

/standard_name= "alpha subunit of the Fl portion 
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of the F0F1 ATPase" 
/label= alpha-subuni t 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

<B) LOCATION : 22 60 . .3126 

(D) OTHER INFORMATION: /codon_start= 2260 
' /product= "ATPase subunit" 

/gene= H atpG ,f 

/s tandard_name= "gamma subunit of the Fl portion 
of the F0F1 ATPase" 
/label= gamma- subunit 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 3301 .. 4707 

<D) OTHER INFORMATION: /codon_start= 3301 
/product= "ATPase subunit" 
/gene= "atpD" 

/s tanda rd_name= "beta subunit of the Fl portion of 
the F0F1 ATPase" 
/label= beta-subunit 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TATCTCGCTA AGTTAGGAGA ATAAG ATG ACA AAA GTA AAT TCA CAA AAA TAC 52 

Met Thr Lys Val Asn Ser Gin Lys Tyr 
1 5 

AGT AAA GCT TTA CTT GAG GTC GCC CGA GAA AAA GGA CAA CTT GAA GCA 100 
Ser Lys Ala Leu Leu Glu Val Ala Arg Glu Lys Gly Gin Leu Glu Ala 
10 15 . 20 25 

ATT CTT ACT GAA GTT AGC GAA ATG ATT CAG CTT TTC AAA GAA AAT AAC 14 8 

lie Leu Thr Glu Val Ser Glu Met lie Gin Leu Phe Lys Glu Asn Asn 
30 3 5 40 

TTA GGT GCT TTT TTA GCA AAT GAA GTT TAT TCA TTC TCT GCT AAA TCT 19 6 

Leu Gly Ala Phe Leu Ala Asn Glu Val Tyr Ser Phe Ser Ala Lys Ser 
45 50 55 

GAA TTG ATT GAT ACT TTG CTT CAA ACT TCA TCA GAA GTG ATG TCA AAT 24 4 

Glu Leu lie Aso Thr Leu Leu Gin Thr Ser Ser Glu Val Met Ser Asn 
60 65 70 

TTC CTG AAT ACT ATT CGT TCT AAT GGA CGT CTA GCT GAC CTC GGA GAA 2 92 

Phe Leu Asn Thr lie Arg Ser Asn Gly Arg Leu Ala Asp Leu Gly Glu 
75 80 85 

ATA CTT GAA GAA ACT AAA AAT GCA GCA GAT GAC ATG TTC AAA ATT GCT 34 0 

lie Leu Glu Glu Thr Lys Asn' Ala Ala Asp Asp Met Phe Lys lie Ala 
90 95 100 . 105 

GAC GTT GAA GTT GTT TCA AGT ATT GCA TTG TCA GAA GCT CAA ATT GAA 38 8 

Asp Val Glu Val Val Ser Ser He Ala Leu Ser Glu Ala Gin lie Glu 
110 115 120 

AAA TTT AAA GCA ATG GCT AAA TCA AAA TTT GAT TTA AAC GAA GTA ACA 4 36 

Lys Phe Lys Ala Met Ala Lys Ser Lys Phe Asp Leu Asn Glu Val Thr 
125 130 135 
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GTA ATT AAT ACA GTC AAT GAA AAA ATT CTC GGA GGA TTC ATT GTG AAC 4 84 

Val lie Asn Thr Val Asn Glu Lys lie Leu Gly Gly Phe lie Val Asn 
140 145 150 

TCT CGT GGA AAA ATT ATT GAC GCC TCA TTA AAA ACA CAA TTG GCT AAA 5 32 

Ser Arg Gly Lys lie He Asp Ala Ser Leu Lys Thr Gin Leu Ala Lys 
,155 160 165 

ATC GCC GCT GAA ATC CTC TAATCAGGAT AGAAAAATTT TCTTCCTTTG 580 
He Ala Ala Glu He Leu 
170 175 

TTAAAAACTT AGTGGAGAAT TTTTCAAACT CAAACTGTTA AACTTTTGAA AACATGCAAA 64 0 

GGTAATTTTA AAACTTGCTT ATTCATGCTC AAAAAGTATA ACTGCAGTTT AAAGCTAAAT 7 00 

AGCCTTGAAC TAGTAAAAAA TTTCTAGAAG G GAG CAT ATT T TTG GCA ATT AAA 7 53 

Leu Ala He Lys 

1- 

GCT AAT GAA ATC AGC TCA CTG ATT AAA AAA CAA ATT GAA AAT TTC ACA 801 
Ala Asn Glu lie Ser Ser Leu lie Lys Lys Gin He Glu Asn Phe Thr 
5 10 15 20 

CCA GAT TTT GAA GTT GCT GAA ACT GGT GTC GTT ACC TAT GTT GGT GAT 84 9 

Pro Asp Phe Glu Val Ala Glu Thr Gly Val Val Thr Tyr Val Gly Asp 
25 30 35 

GGT ATC GCG CGT GCC TAT GGC CTT GAA AAT GCG ATG AGC GGT GAG CTT 8 97 

Gly He Ala Arg Ala Tyr Gly Leu Glu Asn Ala Met Ser Gly Giu Leu 
40 45 50 

GTT GAG TTT TCA AAT GGT ATA CTT GGT ATG GCG CAA AAC TTG GAT GCT 94 5 

Val Glu Phe Ser Asn Gly He Leu Gly Met Ala Gin Asn Leu Asp Ala 
55 60 65 

ACA GAC GTT GGT ATT ATC GTA CTT GGT GAT TTC CTC TCA ATT CGT GAA 9 93 

Thr Asp Val Gly He He Val Leu Gly Asp Phe Leu Ser He Arg Glu 
70 75 80 

GGT GAC ACT GTT AAA CGT ACA GGT AAA ATC ATG GAA ATC CAA GTT GGT 1041 
Gly Asp Thr Val Lys Arg Thr Gly Lys He Met Glu lie Gin Val Gly 
85 90 95 100 

GAA GAA CTC ATC GGA CGT GTT GTA AAC CCA CTT GGA CAA CCC GTT GAT 108 9 

Glu Glu Leu He Gly Arg Val Val Asn Pro Leu Gly Gin Pro Val Asp 
105 110 115 

GGA CTT GGA GAA CTT AAT ACA GGT AAA ACT CGT CCA GTT GAA GCA AAA 1137 
Gly Leu Gly Glu Leu Asn Thr Gly Lys Thr Arg Pro Val Glu Ala Lys 
. 120 125 130 

GCT CCT GGT GTT ATG CAA CGT AAA TCA GTC TCT GAG CCA TTA CAA ACT 118 5 

Ala Pro Gly Val Met Gin Arg Lys Ser Val Ser Glu Pro Leu Gin Thr 
135 140 145 



GGT CTT AAA GCG ATT GAT GCC CTC GTT CCA ATT GGA CGT GGA CAA CGT 
Gly Leu Lys Ala He Asp Ala Leu Val Pro He Gly Arg Gly Gin Arg 
150 155 160 
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GAA TTA ATT ATC GGA GAC CGT CAA ACT GGT AAA ACT TCA GTC GCT ATT 12 8 1 

Glu Leu lie He Gly Asp Arg Gin Thr Gly Lys Thr Ser Val Ala He 

165 . 170 175 180 

GAT GCA ATC TTG AAC CAA AAA GGT CAA GAT ATG ATT TGT ATC TAT GTT 132 9 

Asp Ala He Leu Asn Gin Lys Gly Gin Asp Met He Cys He Tyr Val 

185 190 195 

GCG ATT GGA CAA AAA GAG TCA ACA GTT CGT ACA CAA GTT GAA ACG CTC 1377 

Ala lie Gly Gin Lys Glu Ser Thr Val Arg Thr Gin Val Glu Thr Leu 

200 205 210 

CGT AAA CTC GGT GCG ATG GAT TAT ACA ATC GTC GTA ACT GCG TCA GCT 142 5 

Arg Lys Leu Gly Ala Met Asp Tyr Thr He Val Val Thr Ala Ser Ala 
215 220 225 

TCT CAA CCT TCT CCA CTC CTT TAC ATC GCT CCT TAC GCT GGA GCT GCA 147 3 

Ser Gin Pro Ser Pro Leu Leu Tyr He Ala Pro Tyr Ala Gly Ala Ala 
230 235 240 

ATG GGT GAA GAA TTT ATG TAT AAC GGT AAA CAT GTC TTG GTT GTT TAT 1521 

Met Gly Glu Glu Phe Met Tyr Asn Gly Lys His Val Leu Val Val Tyr 

245 250 255 260 

GAT GAT TTA TCT AAA CAA GCG GTC GCT TAC CGT GAA CTT TCT CTC TTG ,1569 

Asp Asp Leu Ser Lys Gin Ala Val Ala Tyr Arg Glu Leu Ser Leu Leu 

265 270 275 

CTC CGT CGT CCA CCA GGT CGT GAA GCA TAC CCA GGT GAC GTT TTC TAC -.1617 

Leu Arg Arg Pro Pro Gly Arg Glu Ala Tyr Pro Gly Asp Val Phe Tyr 

280 285 290 

TTG CAC TCA CGT CTT TTG GAA CGT GCT GCT AAA TTG TCT GAT GAT CTT 1665 

Leu His Ser Arg Leu Leu Glu Arg Ala Ala Lys Leu Ser Asp Asp Leu 
295 300 305 

GGT GGT GGA TCA ATG ACG GCT TTG CCA TTC ATT GAA ACA CAA GCA GGT ^17 13 

Gly Gly Gly Ser Met Thr Ala Leu Pro Phe He Glu Thr Gin Ala Gly 
310 315 320 

GAT ATC TCA GCT TAT ATT CCA ACA AAC GTT ATC TCT ATT ACC GAC GGT 1761 

Asp He Ser Ala Tyr He Pro Thr Asn Val He Ser He Thr Asp Gly 

325 330 335 340 

CAA ATT TTC CTT GAA AAT GAC TTG TTC TAT TCA GGT GTA CGT CCT GCC 18 09 

Gin He Phe Leu Glu Asn Asp Leu Phe Tyr Ser Gly Val Arg Pro Ala 

345 350 355 

ATT GAT GCT GGT TCA TCA GTA TCA CGT GTT GGT GGT GCC GCA. CAA ATC 18 57 

He Asp Ala Gly Ser Ser Val' Ser Arg Val Gly Gly Ala Ala* Gin lie 

360 365 370 

AAA GCC ATG AAG AAA GTA GCT GGT ACT TTG CGT CTT GAC CTT GCG TCG 1905 

Lys Ala Met Lys Lys Val Ala Gly Thr Leu Arg Leu Asp Leu Ala Ser 
375 380 385 

TTC CGT GAA CTT GAA GCC TTT ACA CAA TTT GGT TCT GAC CTT GAT GAA 19 53 

Phe Arg Glu Leu Glu Ala Phe Thr Gin Phe Gly Ser Asp Leu Asp Glu 
390 395 400 

GCG ACT CAA GCA AAA TTG AAT CGT GGT CGT CGT ACC GTT GAA GTC TTG 2 001 

Ala Thr Gin Ala Lys Leu Asn Arg Gly Arg Arg Thr Val Glu Val Leu 

405 410 .415 420 
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AAA CAA CCA TTG CAC AAA CCA TTG GCT GTT GAA AAA CAA GTT TTG ATT 204? 
Lys Gin Pro Leu His Lys Pro Leu Ala Val Glu Lys Gin Val Leu He 
425 430 435 

CTC TAT GCA TTG ACT CAT GGT CAT CTT GAT AAT GTT CCA GTT GAT GAT 2 097 

Leu Tyr Ala Leu Thr His Gly His Leu Asp Asn Val Pro Val Asp Asp 
, 440 445 450 

GTT CTT GAT TTT GAA ACT AAA ATG TTC GAT TTC TTC GAT GCA AAT TAT 214 5 

Val Leu Asp Phe Glu Thr Lys Met Phe Asp Phe Phe Asp Ala Asn Tyr 
455 460 465 

GCA GAT CTC TTG AAC GTA ATT ACT GAC ACT AAA GAT TTG CCA GAA GAA 219 3 

Ala Asp Leu Leu Asn Val He Thr Asp Thr Lys Asp Leu Pro Glu Glu 
470 475 480 

GCA AAA CTT GAC GAA GCA ATT AAA GCA TTC AAA AAT ACA ACG AAT TAT 22 41 

Ala Lys Leu Asp Glu Ala He Lys Ala Phe Lys Asn Thr Thr Asn Tyr 
485 490 495 500 

TAATAAGGAG GCTAACTA ATG GGA GCT TCA CTT AAC GAA ATA AAA ACT AAG 22 92 

Met Gly Ala Ser Leu Asn Glu He Lys Thr Lys 
1 5 10 

ATT GCG TCA ACA AAG AAA ACA AGT CAA ATC ACA GGT GCC ATG CAA ATG 234 0 

He Ala Ser Thr Lys Lys Thr Ser Gin He Thr Gly Ala Met Gin Met 
15 20 25 

GTT TCT GCT GCT AAA CTT CAA AAA GCA GAA TCT CAC GCT AAA GCT TTT 2 38 6 

Val Ser Ala Ala Lys Leu Gin Lys Ala Glu Ser Kis Ala Lys Ala Phe 
30 35 4 0 

CAG ACT TAT GCT GAA AAA GTA CGT AAG ATT ACG ACT GAC TTA GTT TCA 24 36 

Gin Thr Tyr Ala Glu Lys Val Arg Lys lie Thr Thr Asp Leu Val Ser 
45 50 55 

AGC GAT AAT GAG CCG GCC AAA AAT CCG ATG ATG ATT AAA CGT GAA GTC 24 8 4 

Ser Asp Asn Glu Pro Ala Lys Asn Pro Met Met He Lys Arg Glu Val 
60 65 70 75 

AAG AAA ACT GGC TAT CTC GTT ATC ACA TCA GAT CGT GGG CTT GTT GGC 25 32 

Lys Lys Thr Gly Tyr Leu Val He Thr Ser Asp Arg Gly Leu Val Gly 
80 85 90 

AGT TAT AAT TCA AAT ATT TTG AAG TCT GTT ATA AGT AAT ATA CGT AAA 258 0 

Ser Tyr Asn Ser Asn He Leu Lys Ser Val He Ser Asn He Arg Lys 
95 100 105 

CGC CAC ACA AAT GAG AGT GAG TAT ACA ATA CTT GCC CTT GGT GGT ACG 2628 
Arg His Thr Asn Glu Ser Glu Tyr Thr He Leu Ala Leu Gly Gly Thr 
110 115 120 

GGA GCG GAC TTT TTC AAA GCC CGT AAC GTC AAA GTT TCT TAT GTT CTT 267 6 

Gly Ala Asp Phe Phe Lys Ala Arg Asn Val Lys Val Ser Tyr Val Leu 
125 130 135 

CGC GGA CTT TCA GAT CAA CCG ACC TTT GAA GAG GTT CGG GCA ATT GTT 27 24 

Arg Gly Leu Ser Asp Gin Pro Thr Phe Glu Glu Val Arg Ala He Val 
140 145 150 155 

ACA GAA GCC GTA GAA GAA TAT CAA GCA GAA GAA TTC GAT GAA CTC TAT 2 7 72 

Thr Glu Ala Val Glu Glu Tyr Gin Ala Glu Glu Phe Asp Glu Leu Tyr 
160 165 170 
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GTT 


TGT 


TAC 


AAC 


CAC 


CAT 


GTG 


AAC 


TCA 


TTG 


GTA 


AGT 


GAG 


GCA 


CGG 


ATG 


2 820 


Val 


Cys 


Tyr 


Asn 


His 


His 


Val 


Asn 


Ser 


Leu 


Val 


Ser 


Glu 


Ala 


Arg 


Met 










175 










180 










18 5 








GAA 


AAA 


ATG 


TTA 


CCT 


ATT 


TCT 


TTT 


GAT 


GAA 


AAA 


GGT 


GAC 


GAA 


AAA 


GCA 


c. O O O 


Glu 


Lys 


Met 


Leu 


Pro 


He 


Ser 


Phe 


Asp 


Glu 


Lys 


Gly 


Asp 


Glu 


Lys 


Ala 








190 










195 










200 










TCT 


CTT 


GTT 


ACA 


TTT 


GAA 


TTA 


GAA 


CCA 


GAT 


CGT 


GAA 


ACA 


ATC 


TTA 


AAT 


2 916 


Ser 


Leu 


Val 


Thr 


Phe 


Giu 


Leu 


Glu 


Pro 


Asp 


Arg 


Glu 


Thr 


lie 


Leu 


Asn 






205 










210 










215 












CAG 


TTG 


TTG 


CCG 


CAA 


TAT 


GCT 


GAA 


AGT 


ATG 


ATT 


TAT 


GGC 


TCA 


ATT 


GTT 


£ 5 OH 


Gin 


Leu 


Leu 


Pro 


Gin 


Tyr 


Ala 


Glu 


Ser 


Met 


He 


Tyr 


Gly 


Ser 


He 


Val 




220 










225 










230 










235 




GAT 


GCA 


AAA 


ACA 


GCA 


GAA 


CAT 


GCT 


GCA 


GGT 


ATG 


ACC 


GCA 


ATG 


CGT 


ACT 


3012 


Asp 


Ala 


Lys 


Thr 


Ala 


Glu 


His 


Ala 


Ala 


Gly 


Met 


Thr 


Ala 


Met 


Arg 


Thr 












240 










245 










250 






GCA 


ACA 


GAT 


AAT 


GCA 


CAT 


TCT 


GTC 


ATT 


AAT 


GAT 


TTA 


ACC 


ATT- 


CAA 


TAT 


3060 


Ala 


Thr 


Asp 


Asn 


Ala 


His 


Ser 


Val 


He 


Asn 


Asp 


Leu 


Thr 


Ile 


Gin 


Tyr 










255 










260 










265 






AAC 


CGT 


GCT 


CGT 


CAA 


GCT 


TCA 


ATT 


ACG 


CAA 


GAA 


ATT 


ACG 


GAA 


ATT 


GTT 


: 3108 


Asn 


Arg 


Ala 


Airg 


Gin 


Ala 


Ser 


He 


Thr 


Gin 


Glu 


He 


Thr 


Glu 


He 


Val 





270 275 280 

GCG GGT GCT TCA GCG CTA TAATTACTGT CAAACATTAT TCTCAATGTT - 3156 

Ala Gly Ala Ser Ala Leu 
285 

ACGATTTATC AACTTGAGGA ATAAATGTTC TGTCAGTAAA GGCTTTGAAT TTTAAATACG > 3216 

TTTGTCAGTA AATTTTTACT GATTAGCTTA AAAAT GAAT A GAAATTCTGT TGTTAGACAG 32 7 6 

AAAATAAAAA C AG GAG G AAA AACA TTG AGT TCT GGT AAA ATT ACT CAG GTT ^-3 327 

Leu Ser Ser Gly Lys He Thr Gin Val 
1 5 

ATC GGT CCC GTC GTT GAC GTG GAA TTT GGT TCT GAT GCC AAA CTG CCT 3 375 

He Gly Pro Val Val Asp Val Glu Phe Gly Ser Asp Ala Lys Leu Pro* 
10 15 20 25 

GAG ATT AAC AAT GCC TTG ATT GTC TAC AAA GAT GTC AAT GGT TTA AAA 3423 
Glu He Asn Asn Ala Leu He Val Tyr Lys Asp Val Asn Gly Leu Lys 
30 35 40 

ACA AAA ATT ACT CTT GAA GTT GCT TTG GAA CTT GGT GAT GGT GCA GTT 34 71 

Thr Lys He Thr Leu Glu Val Ala Leu Glu Leu Gly Asp Gly Ala Val 
45 50 55 

CGT ACG ATC GCT ATG GAA TCT ACT GAT GGA TTG ACT CGT GGA CTT GAA 3519 
Arg Thr lie Ala Met Glu Ser Thr Asp Gly Leu Thr Arg Gly Leu Glu 
60 65 70 

GTC CTT GAT ACA GGT AAA GCG GTC AGC GTT CCT GTT GGT GAA TCT ACT 3 567 

Val Leu Asp Thr Gly Lys Ala Val Ser Val Pro Val Gly Glu Ser Thr 
75 80 < 85 

CTT GGT CGT GTT TTT AAT GTC CTT GGT GAC GTT ATT GAT GGT GGA GAA 3615 
Leu Gly Arg Val" Phe Asn Val Leu Gly Asp Val He Asp Gly Gly Glu 
90 95 100 105 
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GAT TTC CCT GCT GAT GCA GAA CGT AAT CCT ATC CAC AAG AAA GCT CCA 3663 
Asp Phe Pro Ala Asp Ala Glu Arg Asn Pro He His Lys Lys Aia Pro 
110 115 .120 

ACT TTT GAC GAA TTG TCA ACT GCA AAT GAA GTT CTT GTA ACA GGG ATT 3711 
Thr Phe Asp Glu Leu Ser Thr Ala Asn Glu Val Leu Val Thr Gly He 
t 125 130 135 

AAA GTT GTC GAT TTA CTT GCC CCT TAT CTT AAA GGT GGG AAA GTC GGA 37 59 

Lys Val Val Asp Leu Leu Ala Pro Tyr Leu Lys Gly Gly Lys Val Gly 
140 145 ^ 150 

CTC TTC GGT GGT GCC GGT GTT GGT AAA ACC GTC CTT ATC CAA GAA TTG 38 07 

Leu Phe Gly Gly Ala Gly Val Gly Lys Thr Val Leu He Gin Glu Leu 
155 160 165 

ATT CAC AAT ATT GCC CAA GAA CAC GGT GGT ATT TCT GTA TTT ACA GGT 38 55 

He His Asn He Ala Gin Glu His Gly Gly lie Ser Val Phe Thr Gly 
170 175 180 185 

GTT GGC GAT CGT ACT CGT GAC GGG AAT GAC CTT TAC TGG GAA ATG AAA 3903 
Val Gly Asp Arg Thr Arg Asp Gly Asn Asp Leu Tyr Trp Glu Met Lys 
190 195 200 

GAA TCA GGC GTT ATT GAA AAA ACA GCC ATG GTC TTT GGT CAA ATG AAT 3951 
Glu Ser Gly Val He Glu Lys Thr Ala Met Val Phe Gly Gin Met Asn 
205 210 215 

GAA CCA CCT GGA GCA CGT ATG CGT GTT GCC CTT ACT GGT TTA ACA ATT 3 9 99 

Glu Pro Pro Gly Ala Arg Met Arg Val Ala Leu Thr Gly Leu Thr He 
220 225 230 

GCG GAA TAT TTC CGT GAT GTT CAA GGA CAA GAC GTA TTG CTT TTC ATC 4 04 7 

Ala Glu Tyr Phe Arg Asp Val Gin Gly Gin Asp Val Leu Leu Phe He 
235 240 245 

GAT AAC ATC TTC CGT TTC ACT CAA GCT GGT TCA GAA GTT TCT GCC CTT 4 09 5 

Asp Asn He Phe Arg Phe Thr Gin Ala .Gly Ser Glu Val Ser Ala Leu 
250 255 260 265 

TGG GGA CGT ATG CCT TCT GCC GTT GGT TAC CAA CCA ACT CTT GCA ACT 414 3 

Trp Gly Arg Met Pro Ser Ala Val Gly Tyr Gin Pro Thr Leu Ala Thr 
270 275 280 

GAA ATG GTT CAA TTA CAG GAA CGT ATC ACT TCT ACT AAG AAG GGT TCT ,4191 
Glu Met Val Gin Leu Gin Glu Arg He Thr Ser Thr Lys Lys Gly Ser 
285 290 295 

GTT ACA TCT ATC CCA' GCG ATT TAT GTC CCT GCC GAT GAC TAT ACT GAC 4239 
Val Thr Ser He Pro Ala lie Tyr Val Pro Ala Asp Asp Tyr Thr Asp 
300 - -305 310 

CCA GCG CCA GCT ACA GCC TTC GCT CAC TTG GAC GCA ACA ACT AAC TTG 4 28 7 

Pro Ala Pro Ala Thr Ala Phe Ala His Leu Asp Ala Thr Thr Asn Leu 
315 320 325 

GAA CGT CGT TTG ACA CAA ATG GGT ATC TAT CCA GCC GTT GAC CCA CTT 4 335 

Glu Arg Arg Leu Thr Gin Met Gly He Tyr Pro Ala Val Asp Pro Leu 
330 335 . • 340 345 

GCT TCA TCA TCA CGT GCG CTT ACA CCT GAA ATT GTT GGT GAA GAA CAC 4 38 3 

Ala Ser Ser Ser Arg Ala Leu Thr Pro Glu lie Val Gly Glu Glu His 
350 355 360 
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He Ala Leu Ser Glu Ala Gin He Glu Lys Phe Lys Ala Met Ala Lys 

115 120 . 125 

Ser Lys Phe Asp Leu Asn Glu Val Thr Val He Asn Thr Val Asn Glu 

130 135 140 

Lys He Leu Gly Gly Phe He Val Asn Ser Arg Gly Lys He He Asp 
145 ' . 150 155 160 

Ala Ser Leu Lys Thr Gin Leu Ala Lys He Ala Ala Glu He Leu 
165 170 - 175 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Leu Ala He Lys Ala Asn Glu He Ser Ser Leu He Lys Lys Gin He 
1 5 10 15 

Glu Asn Phe Thr Pro Asp Phe Glu Val Ala Glu Thr Gly Val Val Thr 
20 25 30 

Tyr Val Gly Asp Gly He Ala Arg Ala Tyr Gly Leu Glu Asn Ala Met 
35 40 45 

Ser Gly Glu Leu Val Glu Phe Ser Asn Gly He Leu Gly Met Ala Gin 
50 55 60 

Asn Leu Asp Ala Thr Asp Val Gly He He Val Leu Gly Asp Phe Leu 
65 70 75 80 

Ser He Arg Glu Gly Asp Thr Val Lys -Arg Thr Gly Lys lie Met Glu 
85 90. 95 

lie Gin Val Gly Glu Glu Leu lie Gly Arg Val Val Asn Pro Leu Gly 
100 105 110 

Gin Pro Val Asp Gly Leu Gly Glu Leu Asn Thr Gly Lys Thr Arg Pro 
115 120 125 

Val Glu Ala Lys Ala Pro Gly Val Met Gin Arg Lys Ser Val Ser Glu 
130 135 140 

Pro Leu Gin Thr Gly Leu Lys Ala lie Asp Ala Leu Val Pro He Gly 
145 150 155 160 

Arg Gly Gin Arg Glu Leu He He Gly Asp Arg Gin The Gly Lys Thr 
165 170 175 

Ser Val Ala He Asp Ala He Leu Asn Gin Lys Gly Gin Asp Met He 
180 . 185 190 



Cys He Tyr Val Ala He Gly Gin Lys Glu Ser Thr Val Arg Thr Gin 
195 200 205 
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TAT GAA GTT GCA ATG GAA GTT CAA CGT GTC CTT CAA CGC TAC AAA GAA 4431 
Tyr Glu Val Ala Met Glu Val Gin Arg Val Leu Gin Arg Tyr Lys Glu 
365 370 375 

TTG CAA GAT ATC ATT GCC ATT CTT GGT ATG GAT GAA TTG TCA GAT GAT 4 4 79 

Leu Gin Asp lie lie Ala lie Leu Gly Met Asp Glu Leu Ser Asp Asp 
380 385 390 

GAA AAA ATT CTC GTT GGA CGT GCA CGT CGT ATC CAA TTC TTC CTT TCA 4 527 

Glu Lys lie Leu Val Gly Arg Ala Arg Arg lie Gin Phe Phe Leu Ser 
395 400 405 

CAA AAC TTC CAC GTT GCT GAA CAG TTT ACT GGT CAA CCT GGT TCA TAT 4 57 5 

Gin Asn Phe His Val Ala Glu Gin Phe Thr Gly Gin Pro Gly Ser Tyr 
410 415 420 425 

GTA CCA ATT GAC AAA ACA GTT CAT GAC TTC AAG GAA ATT TTG GAA GGT 4 673 

Val Pro lie Asp Lys Thr Val His Asp Phe Lys Glu lie Leu Glu Gly 
430 435 440 

AAA TAT GAC GAA GTC CCT GAA GAT GCT TTC CGT GGA GTA GGT CCA ATT 4 671 

Lys Tyr Asp Glu Val Pro Glu Asp Ala Phe Arg Gly Val Gly Pro He 
445 450 455 

GAA GAC GTA CTT GCA AAA GCA AAA TCA ATG GGT TAT TAATTCGATT 4 717 

Glu Asp Val Leu Ala Lys Ala Lys Ser Met Gly Tyr 
460 465 

TCTTATGAAA TGACAAAGTG AAAATACATT ATTGAATCGC AAAATTTACT GACAATAATT 4 777 

CTGTCGTAAG TGCTCACTTT TAAGTTGTTC CGATCGTT 4 815 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Lys Val Asn Ser Gin Lys Tyr Ser Lys Ala Leu Leu Glu Val 
1 5 10 15 

Ala Arg Glu Lys Gly Gin Leu Glu Ala He Leu Thr Glu Val Ser Glu 
20 25 30 

Met He Gin Leu Phe Lys Glu Asn Asn Leu Gly Ala Phe Leu Ala Asn 
35 40 45 

Glu Val Tyr Ser Phe Ser- Ala Lys Ser Glu Leu He Asp Thr Leu Leu 
50 55 60 

Gin Thr Ser Ser Glu Val Met Ser Asn Phe Leu Asn Thr He Arg Sec 
65 70 . 75 80 

Asn Gly Arg Leu Ala Asp Leu Gly Glu He Leu Glu Glu Thr Lys Asn 
85 90 95 

Ala Ala Asp Asp Met Phe Lys He Ala Asp Val Glu Val Val Ser Ser 
100 105 110 
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Val Glu Thr 
210 

Thr Ala Ser 
225 



Leu Arg Lys Leu Gly Ala Met Asp Tyr Thr lie Val Val 
215 220 

Ala Ser Gin Pro Ser Pro Leu Leu Tyr lie Ala Pro Tyr 

230 235 240 



Ala Gly Ala 
Leu Val Val 



Ala Met Gly Glu Glu Phe Met Tyr Asn Gly Lys His Val 
245 250 255 

Tyr Asp Asp Leu Ser Lys Gin Ala Val Ala Tyr Arg Glu 

260 265 270 



Leu Ser Leu Leu Leu Arg Arg Pro Pro Gly Arg Glu Ala Tyr Pro Gly 
275 280 285 



Asp Val Phe 
290 

Ser Asp Asp 
305 



Tyr Leu His Ser Arg Leu Leu Glu Arg Ala Ala Lys Leu 

295 300 

Leu Gly Gly Gly Ser Met Thr Ala Leu Pro Phe lie Glu 

310 315 320 



Thr Gin Ala 



Gly Asp lie Ser Ala Tyr lie Pro Thr Asn Val lie Ser 
325 330 335 



lie Thr Asp 



Gly Gin lie Phe Leu Glu Asn Asp Leu Phe Tyr Ser Gly 
340 345 350 



Val Arg Pro 
355 



Ala lie Asp Ala Gly Ser Ser Val Ser Arg Val Gly Gly 
360 365 



Ala Ala Gin 
370 



lie Lys Ala Met Lys Lys Val Ala Gly Thr Leu Arg Leu 
375 380 



Asp Leu Ala 
385 



Ser Phe Arg Glu Leu Glu Ala Phe Thr Gin Phe Gly Ser 
390 395 400 



Asp Leu Asp 



Glu Ala Thr Gin Ala Lys Leu Asn Arg Gly Arg r Arg Thr 
405 410 415 



Val Glu Val 



Leu Lys Gin Pro Leu His Lys Pro Leu Ala Val Giu Lys 
420 425 430 



Gin Val Leu lie Leu Tyr Ala Leu Thr His Gly His Leu Asp Asn Val 
435 440 445 



Pro Val Asp 
450 

Asp Ala Asn 
465 



Asp Val Leu Asp Phe Glu Thr Lys Met Phe Asp Phe Phe 
455 460 

Tyr- Ala Asp Leu Leu Asn Val lie Thr Asp Thr Lys Asp 

470 475 480 



Leu Pro Glu 



Glu Ala Lys Leu Asp Glu Ala lie Lys Ala Phe Lys Asn 
485 490 495 



Thr Thr Asn 



Tyr 
500 



(2) INFOPMATIpN FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Ala Ser Leu Asn Glu He Lys Thr Lys He Ala Ser Thr Lvs 
1, 5 10 15 

Lys Thr Ser Gin He Thr Gly Ala Met Gin Met.Val Ser Ala Ala Lvs 
20 25 30 

Leu Gin Lys Ala Glu Ser His Ala Lys Ala Phe Gin Thr Tyr Ala Glu 
35 40 45 

Lys Val Arg Lys He Thr Thr Asp Leu Val Ser Ser Asp Asn Glu Pro 
50 55 60 

Ala Lys Asn Pro Met Met He Lys Arg Glu Val Lys Lys Th^ Gi y Tyr 
65 70 75 SO 

Leu Val He Thr Ser Asp Arg Gly Leu Val Gly Ser Tyr Asn Ser Asn 
85 90 9$ 

He Leu Lys Ser Val He Ser Asn He Arg Lys Arg His Thr Asn G 1 u 
100 105 no 

Ser Glu Tyr Thr He Leu Ala Leu Gly Gly Thr Gly Ala Asp Ph^ Phe 
115 120 125 

Lys Ala Arg Asn Val Lys Val Ser Tyr Val Leu Arg Gly Leu Ser Asp 
130 135 KO 

Gin Pro Thr Phe Glu Glu Val Arg Ala He Val Thr Glu Ala Val Glu 
145 150- 155 160 

Glu Tyr Gin Ala Glu Glu Phe Asp Glu Leu Tyr Val Cys Tyr Asn H^ s 
165 170 175 

His Val Asn Ser Leu Val Ser Glu Ala Arg Met Glu Lys Met Leu Pro 
180 185 190 

He Ser Phe Asp Glu Lys Gly Asp Glu Lys Ala Ser Leu Val The Phe 
195 200 205 

Glu Leu Glu Pro Asp Arg Glu Thr He Leu Asn Gin Leu Leu Pro Gin 
210 215 220 

Tyr Ala Glu Ser Met He Tyr Gly Ser lie Val Asp Ala Lys Thr Ala 
225 230 235 240 

Glu His Ala Ala Gly Met Thr Ala Met Arg Thr Ala Thr Asp Asn Ala 
245 250 255 

His Ser Val He Asn Asp Leu Thr He Gin Tyr Asn Arg Ala Arg Gin 
260 265 270 

Ala Ser He Thr Gin Glu He Thr Glu He Val Ala Gly Ala Ser Ala 
275 280 285 

Leu 
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(2) INFORMATION FOR SEQ ID NO: 5: ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Leu Ser Ser Gly Lys lie Thr Gin Val lie Gly Pro Val Val Asp Val 
1 5 10 15 

Glu Phe Gly Ser Asp Ala Lys Leu Pro Glu lie Asn Asn Ala Leu lie 
20 25 30 

Val Tyr Lys Asp Val Asn Gly Leu Lys Thr Lys lie Thr Leu Glu Val 
35 40 45 

Ala Leu Glu Leu Gly Asp Gly Ala Val Arg Thr He Ala Met Glu Ser 
-50 55 60 

Thr Asp Gly Leu Thr Arg Gly Leu Glu Val Leu Asp Thr Gly Lys Ala 
65 70 75 80 

Val Ser Val Pro Val Gly Glu Ser Thr Leu Gly Arg Val Phe Asn Val 
85 90 95 

Leu Gly Asp Val He Asp Gly Gly Glu Asp Phe Pro Ala Asp Ala Glu 
100 105 110 

Arg Asn Pro He His Lys Lys Ala Pro Thr Phe Asp Glu Leu Ser Thr 
115 120 125 

Ala Asn Glu Val Leu Val Thr Gly He Lys Val Val Asp Leu Leu Ala 
130 135 140 

Pro Tyr Leu Lys Gly Gly Lys Val Gly Leu Phe Gly Gly Ala Gly Val 
145 ISO 155 160 

Gly Lys Thr Val Leu lie Gin Glu Leu He His Asn He Ala Gin Glu 
165 170 175 

His Gly Gly He Ser Val Phe Thr Gly Val Gly Asp Arg Thr Arg Asp 
180 185 190 

Gly Asn Asp Leu Tyr Trp Glu Met Lys Glu Ser Gly Val He Glu Lys 
195 200 205 

Thr Ala Met Val Phe Gly Gin Met Asn Glu Pro Pro Gly Ala Arg Met 
210 215 220 

Arg Val Ala Leu Thr Gly Leu Thr He Ala Glu Tyr Phe Arg Asp Val 
225 230 235 240 

Gin Gly Gin Asp Val Leu Leu Phe He Asp Asn He Phe Arg Phe Thr 
245 250 255 

Gin Ala Gly Ser Glu Val Ser Ala Leu Trp Gly Arg Met Pro Ser Ala 
260 265 270 

Val Gly Tyr Gin Pro Thr Leu Ala Thr Glu Met Val Gin Leu Gin Glu 
275 280 285 
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Arg lie Thr Ser Thr Lys 
290 

Tyr Val Pro Ala Asp Asp 

305 310 

Ala ^His Leu Asp Ala Thr 
325 

Gly He Tyr Pro Ala Val 
340 



Lys Gly Ser Val Thr 
295 

Tyr Thr Asp Pro Ala 
315 

Thr Asn Leu Glu Arg 
330 

Asp Pro Leu Ala Ser 
345 



Ser He Pro Ala He 
300 

Pro Ala Thr Ala Phe 
320 

Arg Leu Thr Gin Met 
335 

Ser Ser Arg Ala Leu 
350 



Thr Pro Glu He Val Gly Glu Glu His Tyr Glu Val Ala Met Glu Val 
355 360 365 

Gin Arg Val Leu Gin Arg Tyr Lys Glu Leu Gin Asp He He Ala He 
370 375 380 

Leu Gly Met Asp Glu Leu Ser Asp Asp Glu Lys He Leu Val Gly Arg 
385 390 395 400 

Ala Arg Arg He Gin Phe Phe Leu Ser Gin Asn Phe His Val Ala Glu 
405 410 415 

Gin Phe Thr Gly Gin Pro Gly Ser Tyr Val Pro He Asp Lys Thr Val 
420 425 430 

His Asp Phe Lys Glu He Leu Glu Gly Lys Tyr Asp Glu Val Pro Glu 
435 440 445 

Asp Ala Phe Arg Gly Val Gly Pro He Glu Asp Val Leu Ala Lys Ala 
450 455 460 



Lys Ser Met Gly Tyr 
465 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis subsp. lactis 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATIONS . . 633 

(D) OTHER INFORMATION: /partial 
/codon_start= 4 

/product= "ATPase subunit, partial sequence" 
/gene= "atpA" 

/standard_name= "alpha subunit of the Fl portion 
of the F0F1 ATPase" 
/label= aipha-subuni t 
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(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 652. . 1518 

(D) OTHER INFORMATION: / codon_s ta r t= 652 
/product= "ATPase subunit" 
/gene= "atpG M 

/s tandard_name= "gamma subunit of the Fl portion 
of the F0F1 ATPase" 
/label= gamma- subunit 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1654 . .2205 

(D) OTHER INFORMATION: /partial 
/codon_s tart= 1654 

/product^ "ATPase subunit, partial sequence" 
/gene= "atpD" 

/standard_name= "beta subunit of the Fl portion of 
the F0F1 ATPase" 
/label= beta-subunit 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TGA TTC TAC TTA CAT TCA CGT CTT TTG GAA CGT GCT GCC AAA TTA TCT 4 6 

Phe Tyr Leu His Ser Arg Leu Leu Giu Arg Ala Ala Lys Leu Ser 
470 475 480 

GAC TAT CTT GGT GGT GGT TCA ATG ACT GCA CTG CCA TTC ATT GAA ACA 96 

Asp Tyr Leu Gly Gly Gly Ser Met Thr Ala Leu Pro Phe lie Glu Thr 
485 490 495 500 

CAA GCC GGA GAT ATC TCA GCT TAT ATT GCA ACA AAC GTT ATC TCT ATT 144 

Gin Ala Gly Asp lie Ser Ala Tyr lie Ala Thr Asn Val lie Ser He 

505 510 515 

ACT GAC GGT CAA ATT TTC CTT GAA AAT GAC TTA TTC TAT TCA GGT GTA - 192 

Thr Asp Gly Gin He Phe Leu Glu Asn Asp Leu Phe Tyr Ser Gly Val 
520 525 530 

CGT CCT GCC ATC GAT GCT GGT TCT TCA GTT TCT CGG GTT GGT GGT GCT 24 0 

Arg Pro Ala He Asp Ala Gly Ser Ser Val Ser Arg Val Gly Gly Ala 
535 540 545 

GCA CAG ATC AAA GCC ATG AAG AAA GTT GCT GGT ACT TTG CGT CTT GAC 2 88 

Ala Gin He Lys. Ala Met Lys Lys Val Ala Gly Thr Leu Arg Leu Asp 
550 555 560 

CTT GCG TCA TTC CGT GAA CTT GAA GCC TTT ACT CAA TTT GGT TCT GAT 3 36 

Leu Ala Ser Phe Arg Glu Leu Glu Ala Phe Thr Gin Phe Gly Ser Asp 
565 570 575 580 

CTT GAT GAA GCG ACT CAA GCA AAA TTG AAT CGT GGT CGT CGT ACC GTT 3 84 

Leu Asp Glu Ala Thr Gin Ala Lys Leu Asn Arg Gly Arg Arg Thr Val 

585 590 595 

GAA GTT TTG AAG CAA CCA TTG CAC AAA CCA TTG GCT GTT GAA AAA CAA 4 32 

Glu Val Leu Lys Gin Pro Leu His Lys Pro Leu Ala Val Glu Lys Gin 
600 605 610 

GTT TTA ATT CTT TAT GCA TTG ACT CAT GGT CAC TTG GAT GAT GTT CCA 4 80 

Val Leu He Leu Tyr Ala Leu Thr His Gly His Leu Asp Asp Val Pro 
615 620 625 
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GTT GAT GAC GTC CTT GAT TTT GAA ACA AAC AAT GTC CGA TTC TTC GAT 528 
Val Asp Asp Val Leu Asp Phe Glu Thr Asn Asn Val Arg Phe Phe Asp 
630 635 640 

GCA AAT TAT GCA AAA CTC TTG AAC GTG ATT ACT GAA ACT AAA GAT TGC 576 
Ala Asn Tyr Ala Lys Leu Leu Asn Val lie The Glu Thr Lys Asp Cys 
645 t 650 655 660 

CAG AAG AAG CAA AAC TCG ACG AAG CAA TTA AAG CAT TCT AAA ATA CAA 624 
Gin Lys Lys Gin Asn Ser Thr Lys Gin Leu Lys His Ser Lys lie Gin 
665 670 675 

CGA ATT ATT AAT AAG GAG G CTAATCTA ATG GGA GCT TCA CTT AAT GAA ATA 67 5 

Arg lie lie Met Gly Ala Ser Leu Asn Glu lie 

1 5 

AAA ACT AAG ATT GCC TCA ACG AAG AAA ACA AGT CAA ATA ACT GGA GCC 723 

Lys Thr Lys lie Ala Ser Thr Lys Lys Thr Ser Gin lie Thr Gly Ala 
10 15 20 

ATG CAA ATG GTT TCC GCT GCG AAA CTT CAA AAA GCT GAA TCT CAT GCC 771 
Met Gin Met Val Ser Ala Ala Lys Leu Gin Lys Ala Glu Ser His Ala 
25 30 35 40 

AAA GCA TTT CAA ATT TAT GCT GAA AAA GTT CGT AAA ATT ACA ACT GAT 819 
Lys Ala Phe Gin He Tyr Ala Glu Lys Val Arg Lys He Thr Thr Asp 
45 50 55 

TTA GTT TCC TCT GAC AAA GAG CCA GCT AAG AAT CCA ATG ATG ATA GGA 8 67 

Leu Val Ser Ser Asp Lys Glu Pro Ala Lys Asn Pro Met Met He Gly 
60 65 70 

AGA GAA GTC AAA AAA ACT GGC TAT CTT GTA ATT ACT TCG GAT CGT GGA 915 
Arg Glu Val Lys Lys Thr Gly Tyr Leu Val He Thr Ser Asp Arg Gly 
75 80 85 

CTT GTC GGT GGC TAT AAT TCA TAT ATT TTG AAA TCT GTC ATG AAT ACT 963 
Leu Val Gly Gly Tyr Asn Ser Tyr He Leu Lys Ser Val Met Asn Thr 
90 95 100 

ATC CGT AAA CGT CCT GCT AAT GAA AGT GAA TAT ACT ATT CTT GCA CTT 1011 
He Arg Lys Arg Pro Ala Asn Glu Ser Glu Tyr Thr lie Leu Ala Leu 
105 110 115 120 

GGC GGT ACT GGA GCA GAT TTC TTC GGA GCA AGC AAT GTT AAA AGT TTC 10 59 

Gly Gly Thr Gly Ala Asp Phe Phe Gly Ala Ser Asn Val Lys Ser Phe 
125 130 135 

TTA GTC CTT TGT GGT TTT TCA GAC CAA CCA AAT TTT GAA GAA GTT AGA 1107 
Leu Val Leu Cys Gly Phe Ser Asp Gin Pro Asn Phe Glu Glu Val Arg 
140 145 150 

GCG ATT GTT ACA GAA GCG GTA ACT GAA TAT CAA GCA GAA GAA TTT GAT 1155 
Ala He Val Thr Glu Ala Val Thr Glu Tyr Gin Ala Glu Glu Phe Asp 
155 160 165 

GAA CTT TAT GTT TGC TAT AAT CAC CAT GTG AAC TCA TTG GTA AGT GAA 12 03 

Glu Leu Tyr Val Cys Tyr Asn His His Val Asn Ser Leu Val Ser Glu 
170 175 180 

GCA AGT ATG GAA AAA ATG TTG CCT ATT TTT TTT GAA GCA TCA GGT CAA 1251 
Ala Ser Met Glu Lys Met Leu Pro He Phe Phe Glu Ala Ser Gly Gin 
185 190 195 200 
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CAA AAA CCA TTT TTT GAA ACA TTT GAA TTA GAA CCA GAT TGT GAA ACA 12 99 

Gin Lys Pro Phe Phe Glu Thr Phe Glu Leu Glu Pro Asp Cys Glu Thr 
205 210 . 215 

ATT TTA AAC CAA TTG TTG CCA CCA TAC GCT GAA AGT ATG ATT TAT GGT 134 7 

lie Leu Asn Gin Leu Leu Pro Pro Tyr Ala Glu Ser Met lie Tyr Gly 
220 225 230 

TCA ATC GTT GAT GCT AAG ACA GCA GAA CAT GCT GCA GGT ATG ACA GCA 13 95 

Ser lie Val Asp Ala Lys Thr Ala Glu His Ala Ala Gly Met Thr Ala 
235 240 245 

ATG CGT ACT GCA ACT GAT AAT GCT CAC TCT GTT ATC MT GAT TTG ACT 14 43 

Met Arg Thr Ala Thr Asp Asn Ala His Ser Val lie Asn Asp Leu Thr 
250 255 260 

ATT CAA TAC AAC CGT GCT CGT CAA GCA TCG ATT ACG CAA GAA ATT ACG 14 91 

lie Gin Tyr Asn Arg Ala Arg Gin Ala Ser lie Thr Gin Glu lie Thr . 
265 270 275 280 

GAA ATC GTT GCA GGA GCC TCA GCG CTT TAATTTACTG ATAGGAATTC 15 38 

Glu lie Val Ala Gly Ala Ser Ala Leu 

285 ' 

TGTCAGTGAT GGCTTTGAAT CTTAATTGTT TTTGTCAGTA AAATTTT TAC TGACAAACAT 15 98 

AAAAATGAAT AGAAATTCTG TTCTTTGACA GAAAATAAAA ACAGGAGGAA AAACA TTG 16 56 

Leu 
1 

AGT TCT GGT AAA ATT ACT CAG ATT ATC GGT CCC GTC GTT GAC GTG GAA 17 04 

Ser Ser Gly Lys lie Thr Gin He He Gly Pro Val Val Asp Val Glu 
5 10 15 

TTT GGT TCT GAT GCC AAA TTG CCT GAG ATT AAC AAT GCC TTG ATT GTC * 7 52 
Phe Gly Ser Asp Ala Lys Leu Pro Glu He Asn Asn Ala Leu He Val 
20 2 5 . 30 

TAC AAA GAT GTC AAT GGC CTA AAA ACA AAA ATT ACT CTT GAA GTT GCT 18 00 

Tyr Lys Asp Val Asn Gly Leu Lys Thr Lys He Thr Leu Glu Val Ala 
35 40 45 

TTG GAA CTT GGT GAT GGT GCA GTT CGT ACA ATC GCT ATG GAA TCT ACT 184 8 

Leu Glu Leu Gly Asp Gly Ala Val Arg Thr He Ala Met Glu Ser Thr 
50 55 60 65- 

GAT GGC TTG ACT CGT GGA CTT GAA GTC CTT GAT ACA GGT AAA GCA GTC 18 96 

Asp Gly Leu The Arg Gly Leu Glu Val Leu Asp Thr Gly Lys Ala Val 
70 .75 80 

AGC GTT CCT GTT GGG GAA GCC ACT CTT GGT CGT GTT TTT AAC GTC CTT 194 4 

Ser Val Pro Val Gly Glu Ala Thr Leu Gly Arg Val Phe Asn Val Leu 
85 90 95 

GGT GAT GTT ATT GAC GGT GGG GAA GAA TTT GCT GCT GAT GCA GAA CGT 1992 
Gly Asp Val He Asp Gly Gly Glu Glu Phe Ala Ala Asp Ala Glu Arg 
100 105 110 

AAT CCT ATC CAT AAA AAA GCT CCA ACA TTT GAC GAA TTG TCA ACT GCA 204 0 

Asn Pro He His Lys Lys Ala Pro Thr Phe Asp Glu Leu Ser Thr Ala 
115 120 125 
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AAC GAA GTT CTC GTA ACT GGG ATT AAA GTT GTC GAT TTG CTT GCA CCT 2 08 8 

Asn Glu Val Leu Val The Gly lie Lys Val Val Asp Leu Leu Ala Pro 
130 135 140 145 

TAC CTT AAA GGT GGT AAA GTT GGA CTT TTC GGT GGT GCC GGA GTT GGT 2136 
Tyr Leu Lys Gly Gly Lys Val Gly Leu Phe Gly Gly Ala Gly Val Gly 
150 155 160 

AAA GCC GTC CTT ATT CAA GAA TTG AAA CAC AAC ATC GCC CAA GAA CAC 2184 
Lys Ala Val Leu lie Gin Glu Leu Lys His Asn lie Ala Gin Glu His 
165 170 175 

GGA GGT ATT TCT GTG TTT ACC GG 22 07 

Gly Gly He Ser Val Phe Thr 
180 



(2) INFORMATION FOR SEQ ID NO: 7: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Phe Tyr Leu His Ser Arg Leu Leu Glu Arg Ala Ala Lys Leu Ser Asp 
1 5 10 15 

Tyr Leu Giy Gly Gly Ser Met Thr Ala Leu Pro Phe He Glu Thr Gin 
20 25 30 

Ala Gly Asp He Ser Ala Tyr He Ala Thr Asn Val lie Ser He Thr 
35 40 45 

Asp Gly Gin He Phe Leu Glu Asn Asp Leu Phe Tyr Ser Giy Val Arg 
50, 55 60 

Pro Ala He Asp Ala Gly Ser Ser Val Ser Arg Val Gly Gly Ala Ala 
65 70 75 80 

Gin He Lys Ala Met Lys Lys Val Ala Gly Thr Leu Arg Leu Asp Leu 
85 90 95 

Ala Ser Phe Arg Glu Leu Glu Ala Phe Thr Gin Phe Gly Ser Asp Leu 
100 105 HO 

Asp Glu Ala Thr Gin Ala Lys Leu Asn Arg Gly Arg Arg Thr Val Glu 
115 120 125 

Val Leu Lys Gin Pro Leu His Lys Pro Leu Ala Val Glu Lys Gin Val 
130 135 140 

Leu He Leu Tyr Ala Leu Thr His Gly His Leu Asp Asp Val Pro Val 
145 150 155 160 

Asp Asp Val Leu Asp Phe Glu Thr Asn Asn Val Arg Phe Phe Asp Ala 
165 • 170 . 175 

Asn Tyr Ala Lys Leu Leu Asn Val He Thr Glu Thr Lys Asp Cys Gin 
180 185 190 
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Lys Lys Gin Asn Ser Thr Lys Gin Leu Lys His Ser Lys lie Gin Arg 
195 200 205 

He He 
210 

(2)' INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Gly Ala Ser Leu Asn Glu He Lys Thr Lys He Ala Ser Thr Lys 
15 10 15 

Lys Thr Ser Gin He Thr Gly Ala Met Gin Met Val Ser Ala Ala Lys 
20 25 30 

Leu Gin Lys Ala Glu Ser His Ala Lys Ala Phe Gin He Tyr Ala Glu 
35 40 45 

Lys Val Arg Lys He Thr Thr Asp Leu Val Ser Ser Asp Lys Glu Pro 
50 55 60 

Ala Lys Asn Pro Met Met He Gly Arg Glu Val Lys Lys Thr Gly Tyr 
65 70 75 80 

Leu Val He Thr Ser Asp Arg Gly Leu Val Gly Gly Tyr Asn Ser Tyr 
85 90 95 

He Leu Lys Ser Val Met Asn Thr He Arg Lys Arg Pro Ala Asn Glu 
100 105 110 

Ser Glu Tyr Thr He Leu Ala Leu Gly Gly Thr Gly Ala Asp Phe Phe 
115 120 125 

Gly Ala Ser Asn Val Lys Ser Phe Leu Val Leu Cys Gly Phe Ser Asp 
130 135 140 

Gin Pro Asn Phe Glu Glu Val Arg Ala He Val Thr Glu Ala Val Thr 
145 150 155 160 

Glu Tyr Gin Ala Glu Glu Phe Asp Glu Leu Tyr Val Cys Tyr Asn His 
165 170 175 

His Val Asn . Ser Leu Val Ser Glu Ala Ser Met Glu Lys Met Leu Pro 
180 185 190 

He Phe Phe Glu Ala Ser Gly Gin Gin Lys Pro Phe Phe Glu Thr Phe 
195 200 205 

Glu Leu Glu Pro Asp Cys Glu Thr He Leu Asn Gin Leu Leu Pro Pro 
210 215 220 



Tyr Ala Glu Ser Met He Tyr Gly Ser He Val Asp Ala Lys Thr Ala 
225 230 . 235 240 
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Glu His Ala Ala Gly Met Thr Ala 
245 

His Ser Val lie Asn Asp Leu Thr 
260 

Ala, Ser He Thr Gin Glu He Thr 
275 280 

Leu 



Met Arg Thr Ala Thr Asp Asn Ala 
250 255 

He Gin Tyr Asn Arg Ala Arg Gin 
265 270 

Glu He Val Ala Gly Ala Ser Ala 
285 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 9 : 

Leu Ser Ser Gly Lys He Thr Gin He He. Gly Pro Val Val Asp Val 
15 10 15 

Glu Phe Gly Ser Asp Ala Lys Leu Pro Glu He Asn Asn Ala Leu He 
20 25 30 

Val Tyr Lys Asp Val Asn Gly Leu Lys Thr Lys He Thr Leu Glu Val 
35 40 45 

Ala Leu Glu Leu Gly Asp Gly Ala Val Arg Thr He Ala Met Glu Ser 
50 55 60 

Thr Asp Gly Leu Thr Arg Gly Leu Glu Val Leu Asp Thr Gly Lys Ala 
65 70 75 80 

Val Ser Val Pro Val Gly Glu Ala Thr Leu Gly Arg Val Phe Asn Val 
85 90 95 

Leu Gly Asp Val He Asp Gly Gly Glu Glu Phe Ala Ala Asp Ala Glu 
100 105 HO 

Arg Asn Pro He His Lys Lys Ala Pro Thr Phe Asp Glu Leu Ser Thr 
115 120 125 

Ala Asn Glu Val Leu Val Thr Gly He Lys Val Val Asp Leu Leu Ala 
130 135 140 

Pro Tyr Leu Lys Gly Gly Lys Val Gly Leu Phe Gly Gly Ala Gly Val 
145 150 ' 155 160 

Gly Lys Ala Val Leu lie Gin Glu Leu Lys His Asn lie Ala Gin Glu 
165 170 175 

His Gly Gly He Ser Val Phe Thr 
18 0 - 

(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2161 base pairs 



WO 98/10089 




PCT/DK97/00373 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus thermophilus 

(B) STRAIN: ST3 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 
<B) LOCATION : 2 . . 637 
(D) OTHER INFORMATION: /partial 
/codon_start= 2 

/product= "ATPase subuni t, partial sequence" 
/gene= "atpA" 

/standard_name= "alpha subunit of the Fl portion 
of the F0F1 ATPase" 
/label= alpha-subunit 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION : 659 . . 1S37 

(D) OTHER INFORMATION: /codon_start= 659 
/product^ "ATPase subunit" 
/gene= "atpG" 

/standard_name= "gamma subunit of the Fl portion 
of the F0F1 ATPase" 
/labels gamma-subunit 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1616. .2161 

(D) OTHER INFORMATION: /partial 
/codon_start= 1616 

/product= "ATPase subunit, partial sequence" 
/gene= "atpD" 

/standard_name= "beta subunit of the Fl portion of 
the F0F1 ATPase" 
/label= beta-subunit 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

T GAT TCT CAT CTC CAC TCA CGT CTT TTG GAA CGT TCA GCT AAG CTT 
Asp Ser His Leu His Ser Arg Leu Leu Glu Arg Ser Ala Lys Leu 
185 190 195 

TCA GAT GAT . CTT GGT GGT GGT TCA ATG ACT GCC TTG CCA ATC ATC CAA 
Ser Asp Asp Leu Gly Gly 'Gly Ser Met Thr Ala Leu Pro lie lie Gin 
200 205 210 215 

ACA CAA GCA GGA GAT ATC TCA GCT TAT ATC GCG ACA AAC GTT ATT TCT 
Thr Gin Ala Gly Asp lie Ser Ala Tyr He Ala Thr Asn Val He Ser 
220 * 225 230 
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ATC ACA GAT GGA CAA ATC TTC TTG CAA GAA AAT CTT TTC AAC TCA GGT 190 
lie Thr Asp Gly Gin He Phe Leu Gin Glu Asn Leu Phe Asn Ser Gly 
235 240 245 

ATT CGT CCT GCG ATT GAT GCT GGT TCT TCA GTA TCA CGT GTT GGT GGT 238 
He Arg Pro Ala He Asp Ala Gly Ser Ser Val Ser Arg Val Gly Gly 
250 255 260 

TCA GCA CAA ATC AAA GCA ATG AAG AAA GTT GCT GGT ACC CTT CGT CTT 286 
Ser Ala Gin He Lys Ala Met Lys Lys Val Ala Gly Thr Leu Arg Leu 
265 270 275 

GAC TTG GCT TCT CAC CGT GAA CTT GAA GCC TTT ACA CAA TTC GGT TCT 3 34 

Asp Leu Ala Ser His Arg Glu Leu Glu Ala Phe Thr Gin Phe Gly Ser 
2B0 285 290 295 

GAT TTG GAT GCC GCA ACA CAA GCT AAA CTT AAT CGT GGA CGT CGT ACA 382 
Asp Leu Asp Ala Ala Thr Gin Ala Lys Leu Asn Arg Gly Arg Arg Thr 
300 305 310 

GTT GAA GTG CTT AAA CAA CCA CTT CAT AAC CCA CTT CCG GTT GAA AAA 4 30 

Val Glu Val Leu Lys Gin Pro Leu His Asn Pro Leu Pro Val Glu Lys 
315 320 325 

CAA GTT CTT ATT CTT TAC GCT TTG ACA CAT GGC TTC TTG GAC AGT GTT 478 
Gin Val Leu He Leu Tyr Ala Leu Thr His Gly Phe Leu Asp Ser Val 
330 335 340 

CCG GTT GAT CAA ATC TTG GAT TTT GAA GAA GCC CTC TAT GAC TAC TTT 52 6 

Pro Val Asp Gin He Leu Asp Phe Glu Glu Ala Leu Tyr Asp Tyr Phe 
345 350 355 

GAT AGC CAT CAT GAG GAT ATC TTT GAA ACA ATC CGT TCA ACT AAG GAT 57 4 

Asp Ser His His Glu Asp He Phe Glu Thr He Arg Ser Thr Lys Asp 
360 365 370 375 

CTT CCT GAA GAA GCT GTG CTT AAT GAA GCT ATC CAA GCT TTC AAA GAT 622 
Leu Pro Glu Glu Ala Val Leu Asn Glu Ala He Gin Ala Phe Lys Asp 
380 385 390 

CAA TCG GAA TAC AAA TAGAGATAGG GAG GAC AG C A T ATG GCA GGC TCT CTA 67 3 

Gin Ser Glu Tyr Lys Met Ala Gly Ser Leu 

395 1 5 

AGA GAA ATC AAA GCA AAA ATT GCT TCA ATT AAG CAA ACG AGT CAT ATT 721 
Arg Glu He Lys Ala Lys He Ala Ser He Lys Gin Thr Ser His He 
10 15 20 

ACA GGA GCC ATG CAA ATG GTT TCT GCT TCT AAA TTG ACA CGT TCT GAG 7 69 

Thr Gly Ala Met Gin Met Val Ser Ala Ser Lys Leu Thr Arg Ser Glu 
25 30 35 

CAG GCT GCT AAA GAT TTC CAA ATC TAT GCC TCA AAA ATT AGA CAG ATC 817 
Gin Ala Ala Lys Asp Phe Gin He Tyr Ala Ser Lys He Arg Gin He 
40 45 50 

ACA ACA GAT CTT CTA CAT TCA GAA TTG GTT AAT GGT TCT TCA AAT CCG 8 65 

Thr Thr Asp Leu Leu His Ser Glu Leu Val Asn Gly Ser Ser Asn Pro 
55 60 65 

ATG TTG GAT GCA CGT CCA GTT CGT AAG TCA GGG TAT ATT GTC ATT ACT 913 
Met Leu Asp Ala Arg Pro Val Arg Lys Ser Gly Tyr He Val He Thr 
7 ° 75 80 85 
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TCA GAT AAG GGA TTA GTT GGA GGA TAT AAT TCA ACC ATT CTT AAA GCT 961 

Ser Asp Lys Gly Leu Val Gly Gly Tyr Asn Ser Thr lie Leu Lys Ala 
90 95 100 

GTC TTG GAT ATG ATT AAA CGT GAC CAT GAT TCT GAA GAT GAA TAT GCT 1009 

Val Leu Asp Met lie Lys Arg Asp His Asp Ser Glu Asp Glu Tyr Ala 

105 110 115 

ATC ATC TCT ATT GGT GGA ACA GGT TCA GAT TTC TTC AAA GCT CGT AAC 1057 

lie lie Ser He Gly Gly Thr Gly Ser Asp Phe Phe Lys Ala Arg Asn 

120 125 130 

ATG AAT GTT GCT TTT GAA CTT CGT GGC CTT GAA GAT CAA CCT AGT TTC 1105 

Met Asn Val Ala Phe Glu Leu Arg Gly Leu Glu Asp Gin Pro Ser Phe 
135 140 145 

GAT CAA GTC GGG GAA ATC ATT CTA AAA GCT GTA GGA ATG TAT CAA AAT 115 3 

Asp Gin Val Gly Glu He He Leu Lys Ala Val Gly Met Tyr Gin Asn 

150 155 160 165 

GAG CTT TTT GAT GAA CTT TAT GTG TGT TAC AAT CAT CAT ATT AAT AGT 12 01 

Glu Leu Phe Asp Glu Leu Tyr Val Cys Tyr Asn His His He Asn Ser 
170 175 180 

TTG TTT TGT GAA GTT TGT GTT GAA AAA ATG CTT CCA ATT GCT GAT TTT 12 4 9 

Leu Phe Cys Glu Val Cys Val Glu Lys Met Leu Pro He Ala Asp Phe 

185 190 195 

GAT CCT AAT GAA TTT GAA GGC CAT GTA TTG ACC AAG TTT GAA TTG GAA 12 97 

Asp Pro Asn Glu Phe Glu Gly His Val Leu Thr Lys Phe Glu Leu Glu 

200 205 210 

CCA AGT TGT GAT ACT ATT TTG GAT CAA CTT TTG CCC ACA ATA GTC GGT 13 45 

Pro Ser Cys Asp Thr lie Leu Asp Gin Leu Leu Pro Thr He Val Gly 
215 220 225 

GAG AGT TTT ATC TAC GGT GCT ATC GTA GAT GCC AAA ACA GCT GAG CAT 13 93 

Glu Ser Phe He Tyr Gly Ala lie Val Asp Ala Lys Thr Ala Glu His 

230 235 240 245 

GCT GCT GGT ATG ACC GCA ATG CAG ACT GCC ACT GAT AAT GCT AAG AAA 14 41 

Ala Ala Gly Met Thr Ala Met Gin Thr Ala Thr Asp Asn Ala Lys Lys 
250 255 260 

ATA ATT AAC GAT TTA ACA ATT CAA TAC AAC CGT GCA CGT CAA GCA GCC 14 89 

He He Asn Asp Leu Thr He Gin Tyr Asn Arg Ala Arg Gin Ala Ala 

265 270 275 

ATT ACT CAG GAA ATC ACT GAG ATT GTT GGC GGT GCT AGT GCA CTT GAA 15 37 

lie Thr Gin Glu He Thr Glu He Val Gly Gly Ala Ser Ala Leu Glu 

280 285 290 

TAGCTAGAGA TTTGTCTTGA TTTGACATAC AATAAAAAGG GATGATTGTC ATCCAGAAAA 15 97 

CTT CAT AAG G AGAAAACA ATG AGC TCA GGC AAA ATT GCT CAG GTT GTT GGT 164 8 

Met Ser Ser Gly Lys He Ala Gin Val Val Gly 

1 - 5 10 * 



CCT GTT GTA GAC GTA GCG TTT GCA ACT GGC GAT AAA CTT CCT GAG ATT 
Pro Val Val Asp Val Ala Phe Ala Thr Gly Asp Lys Leu Pro Glu He 
15 20 25 



1696 
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AAC AAT GCA TTG GTC GTT TAC ACT GAG AAG AAA AGT CTT AGA CGG ATG 17 4 4 

Asn Asn Ala Leu Val Val Tyr Thr Glu Lys Lys Ser Leu Arg Arg Met 
30 35 40 

GTG CTC GAA GTA GCT TCG TTG AAA CTT GGA GAA GGT GTG GTT CGT ACC 17 92 

Val Leu Glu Val Ala Ser Leu Lys Leu Gly Glu Gly Val Val Arg Thr 
, 45 50 55 

ATT GCC ATG GAA TCT ACT GAT GGA TTG ACT CGT GGG CTA GAA GTT CTG 184 0 

lie Ala Met Glu Ser Thr Asp Gly Leu Thr Arg Gly Leu Glu Val Leu 
60 65 70 75 

GAC ACA GGT CGT CCA ATC AGT GTT CCT GTT GGT AAA GAA CTT CTT GGA 18 86 

Asp Thr Gly Arg Pro lie Ser Val Pro Val Gly Lys Glu Leu Leu Gly 
80 85 90 

CGT GTC TTT AAC GTG CTT GGA GAT ACC ATT GAC ATG GAA GCA CCT TTT 1936 
Arg Val Phe Asn Val Leu Gly Asp Thr lie Asp Met Glu Ala Pro Phe 
95 100 105 

GCA GAT GAT GCA GAG CGT GAA CCA ATT CAT AAA AAA GCA CCT ACC TTC 198 4 

Ala Asp Asp Ala Glu Arg Glu Pro lie His Lys Lys Ala Pro Thr Phe 
110 115 120 

GAT GAA TTG TCA ACA AGT ACT GAA ATC CTT GAA ACA GGG ATT AAA GTT 2 0 32 

Asp Glu Leu Ser Thr Ser Thr Glu He Leu Glu Thr Gly lie Lys Val 
125 130 135 

ATC GAC TTG CTT GCC CCT TAT CTT AAA GGT GGT AAA GTC GGA CTT TTC 208 0 

He Asp Leu Leu Ala Pro Tyr Leu Lys Gly Gly Lys Val Gly Leu Phe 
140 145 150 155 

GGT GGT GCC GGT GTT GGT AAG GCC GTT CTT ATT CAA GAG CTG AAT CAC 212 8 

Gly Gly Ala Gly Val Gly Lys Ala Val Leu He Gin Glu Leu Asn His 
160 165 170 

AAC ATT GCT CAA GAA CAC GGT GGC ATT TCC GTG 2161 
Asn He Ala Gin Glu His Gly Gly He Ser Val 
175 180 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Asp Ser His Leu His Ser Arg Leu Leu Glu Arg Ser Ala Lys Leu Ser 
15 10 15 

Asp Asp Leu Gly Gly Gly Ser Met Thr Ala Leu Pro He He Gin Thr 
20 25 30 

Gin Ala Gly Asp He Ser Ala Tyr. He Ala Thr Asn Val He Ser He 
35 40 45 

Thr Asp Gly Gin He Phe Leu Gin Glu Asn Leu Phe Asn Ser Gly He 
50 55 60 
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Arg Pro Ala lie Asp Ala Gly Ser Ser Val Ser Arg Val Gly Gly Ser 
65 70 75 80 

Ala Gin lie Lys Ala Met Lys Lys Val Ala Gly Thr Leu Arg Leu Asp 
85 90 95 

Leu Ala Ser His Arg Glu Leu Glu Ala Phe Thr Gin Phe Gly Ser Asp 
* 100 105 110 

Leu Asp Ala Ala Thr Gin Ala Lys Leu Asn Arg Gly Arg Arg Thr Val 
115 120 125 

Glu Val Leu Lys Gin Pro Leu His Asn Pro Leu Pro Val Glu Lys Gin 
130 * 135 140 

Val Leu lie Leu Tyr Ala Leu Thr His Gly Phe Leu Asp Ser Val Pro 
145 150 155 160 

Val Asp Gin lie Leu Asp Phe Glu Glu Ala Leu Tyr Asp Tyr Phe Asp 
165 170 175 

Ser His His Glu Asp lie Phe Glu Thr He Arg Ser Thr Lys Asp Leu 
180 185 190 

Pro Glu Glu Ala Val Leu Asn Glu Ala He Gin Ala Phe Lys Asp Gin 
195 200 205 



Ser Glu Tyr Lys 

210 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ala Gly Ser Leu Arg Glu He Lys Ala Lys He Ala Ser He Lys 
1 5 "10 15 

Gin Thr Ser His He Thr Gly Ala Met Gin Met Val Ser Ala Ser Lys 
20 25 30 

Leu Thr Arg Ser Glu Gin Ala Ala Lys Asp Phe Gin He Tyr Ala Ser 
35 40 45 

Lys He Arg Gin He Thr Thr Asp Leu Leu His Ser Glu Leu Val Asn 
50 55 60 

Gly Ser Ser Asn Pro Met Leu Asp Ala Arg Pro Val Arg Lys Ser Gly 
65 70 75 80 

Tyr He Val He Thr Ser Asp Lys Gly Leu Val Gly Gly Tyr Asn Ser 
85 90 95 

Thr He Leu Lys Ala Val Leu Asp Met He Lys Arg Asp His Asp Ser 
100 105 110 
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Glu Asp Glu Tyr Ala lie lie Ser lie Gly Gly Thr Gly Ser Asp Phe 
115 120 125 

Phe Lys Ala Arg Asn Met Asn Val Ala Phe Glu Leu Arg Gly Leu Glu 
130 135 140 

Asp Gin Pro Ser Phe Asp Gin Val Gly Glu lie lie Leu Lys Ala Val 
1.4 5 150 155 160 

Gly Met Tyr Gin Asn Glu Leu Phe Asp Glu Leu Tyr Val Cys Tyr Asn 
165 170 175 

His His lie Asn Ser Leu Phe Cys Glu Val Cys Val Glu Lys Met Leu 
180 185 190 

Pro lie Ala Asp Phe Asp Pro Asn Glu Phe Glu Gly His Val Leu Thr 
195 200 205 

Lys Phe Glu Leu Glu Pro Ser Cys Asp Thr lie Leu Asp Gin Leu Leu 
210 215 220 

Pro Thr lie Val Gly Glu Ser Phe He Tyr Gly Ala He Val Asp Ala 
225 230 235 240 

Lys Thr Ala Glu His Ala Ala Gly Met Thr Ala Met Gin Thr Ala Thr 
245 250 255 

Asp Asn Ala Lys Lys He lie Asn Asp Leu Thr lie Gin Tyr Asn Arg 
260 265 270 

Ala Arg Gin Ala Ala He Thr Gin Glu He Thr Glu lie Val Gly Gly 
275 280 265 

Ala Ser Ala Leu Glu 
290 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Ser Ser Gly Lys He Ala Gin Val Val Gly Pro Val Val Asp Val 
1 5 10 15 

Ala Phe Ala Thr Gly Asp Lys Leu Pro Glu He Asn Asn Ala Leu Val 
20 25 30 

Val Tyr Thr Glu Lys Lys Ser Leu Arg Arg Met Val Leu Glu Val Ala 
35 .40 45 

Ser Leu Lys Leu Gly Glu Gly Val Val Arg Thr He Ala Met Glu Ser 

50 ... 55 • 60 

Thr Asp Gly Leu Thr Arg Gly Leu Glu Val Leu Asp Thr Gly Arg Pro 
65 70 75 80 
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lie Ser Val Pro Val Gly Lys Glu Leu Leu Gly Arg Val Phe Asn Val 
85 90 95 

Leu Gly Asp Thr lie Asp Met Glu Ala Pro Phe Ala Asp Asp Ala Glu 
100 105 110 

Arg Glu Pro lie His Lys Lys Ala Pro Thr Phe Asp Glu Leu Ser Thr 
* 115 120 125 

Ser Thr Glu lie Leu Glu Thr Gly lie Lys Val lie Asp Leu Leu Ala 
130 135 140 

Pro Tyr Leu Lys Gly Gly Lys Val Gly Leu Phe Gly Gly Ala Gly Val 
145 150 155 160 - 

Gly Lys Ala Val Leu He Gin Glu Leu Asn His Asn He Ala Gin Glu 
165 170 175- 

His Gly Gly He Ser Val 
180 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(Ai LENGTH: 914 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

{iv> ANTI-SENSE: NO 

(v) FRAGMENT TYPE: C- terminal 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Phaffia rhodozyma 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 51 .. 82 4 

(D) OTHER INFORMATION: /partial 
/codon_start- 51 

/product= "ATPase subunit, partial sequence" 
/gene= "ATP2 " 

/s tandard_name= "beta subunit of the Fl oortion of 
the F0F1 ATPase" 
/label= beta-subunit 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GAATTCTCAA CCTTGAGGGT GACTCCAAGG TCGCTCTTGT CTTCGGACAG ATG AAC 56 

Met Asn 

GAG CCC CCG GGT GCT CGA GCC CGA GTC GCT TTG ACT GGT TTG ACC ATC 104 
Glu Pro Pro Gly Ala Arg Ala Arg Val Ala Leu Thr Gly Leu Thr He 
185 190 195 200 
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GCC GAG TAC TTC CGA GAC GAG GAA GGA CAG GAT GTC TTG CTT TTC ATC 152 

Ala Glu Tyr Phe Arg Asp Glu Glu Gly Gin Asp Val Leu Leu Phe lie 
205 210 . 215 

GAC AAC ATT TTC CGA TTC ACC CAG GCC GGT TCT GAG GTG TCT GCC TTG 200 

Asp Asn lie Phe Arg Phe Thr Gin Ala Gly Ser Glu Val Ser Ala Leu 
, 220 225 230 

CTT GGT CGA ATT CCC TCC GCC GTC GGA TAC CAG CCC ACT CTT TCC ACC 24 8 

Leu Gly Arg lie Pro Ser Ala Val Gly Tyr Gin Pro Thr Leu Ser Thr 
235 240 245 

GAT ATG GGA GGT ATG CAG GAG CGA ATT ACC ACC ACC AAG AAG GGA TCC 2 96 

Asp Met Gly Gly Met Gin Glu Arg lie Thr Thr Thr Lys Lys Gly Ser 
250 255 260 

ATC ACT TCC GTC CAG GCC GTC TAC GTG CCT GCT GAT GAT TTG ACC GAT 34 4 

lie Thr Ser Val Gin Ala Val Tyr Val Pro Ala Asp Asp Leu Thr Asp 

265 270 275 280 

CCT GCC CCC GCC ACC ACC TTC GCC CAC TTG GAC GCC ACC ACT GTG TTG 3 92 

Pro Ala Pro Ala Thr Thr Phe Ala His Leu Asp Ala Thr Thr Val Leu 
285 290 295 

TCT CGA GGT ATC GCT GAG TTG GGT ATC TAC CCC GCT GTC GAT CCC CTT 4 40 

Ser Arg Gly lie Ala Glu Leu Gly lie Tyr Pro Ala Val Asp Pro Leu 
300 305 310 

GAT TCT AAG TCC CGA ATG CTC GAC CCC CGA ATT GTC GGA CAG GAG CAC 4 88 

Asp Ser Lys Ser Arg Met Leu Asp Pro' Arg lie Val Gly Gin Glu His 
315 320 325 

TAC GAC ATC GCC ACC AAG ACC CAG AAG ATC CTC CAG GAC TAC AAG TCC 536 

Tyr Asp lie Ala Thr Lys Thr Gin Lys lie Leu Gin Asp Tyr Lys Ser 
330 335 340 

CTC CAG GAT ATC ATT GCC ATT CTT GGT ATG GAT GAG TTG TCT GAG GAG 58 4 

Leu Gin Asp lie lie Ala lie Leu Gly Met Asp Glu Leu Ser Glu Glu 

345 350 355 360 

GAC AAG TTG ACC- GTC GAG CGA GCC CGA AAG ATC CAG CGA TTC ATG TCG 632 

Asp Lys Leu Thr Val Glu Arg Ala Arg Lys lie Gin Arg Phe Met Ser 
365 370 375 

CAG CCT TTC GCT GTC GCT CAG GTC TTC ACT GGT ATC GAG GGA AAG CTT 63 0 

Gin Pro Phe Ala Val Ala Gin Val Phe Thr Gly lie Glu Gly Lys Leu 
380 385 390 

GTT CCC TTG AAG ACT ACT TTG GAG TCC TTT AAG GAG CTT CTT TCC GGA 72 8 

Val Pro Leu Lys Thr Thr Leu Glu Ser Phe Lys Glu Leu Leu Ser Gly 
395 400 405 

GCC TGC GAC CAC CTC CCT GAG TCT GCT TTC TAC ATG GTT GGT GAC ATC 77 6 

Ala Cys Asp His Leu Pro Glu Ser Ala Phe Tyr Met Val Gly Asp lie 
410 415 420 

GCT GAT GTC AAG GCC AAG GCT GCT GCC CAG GCT AAG GAG TTG GCT GCT 8 24 

Ala Asp Val Lys Ala Lys Ala Ala Ala Gin Ala Lys Glu Leu Ala Ala 

425 430 * • 435 440 

TAAGAGAAGA GTTGTCGAAT GTGTTTCGAG GTGTCAGAGT TGTCTTTTAT GAATGTTTCT 8 84 



ATCTCCTTAA AAAAAAAAAA AAAAAAAAAA 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

* (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Asn Glu Pro Pro Gly Ala Arg Ala Arg Val Ala Leu Thr Gly Leu 
15 10 15 

Thr lie Ala Glu Tyr Phe Arg Asp Glu Glu Gly Gin Asp Val Leu Leu 
20 25 * 30 

Phe lie Asp Asn lie Phe Arg Phe Thr Gin Ala Gly Ser Glu Val Ser 
35 40 45 

Ala- Leu Leu Gly Arg lie Pro Ser Ala Val Gly Tyr Gin Pro Thr Leu 
50 55 60 

Ser Thr Asp Met Gly Gly Met Gin Glu Arg lie Thr Thr Thr Lys Lys 
65 70 75 80 

Gly Ser lie Thr Ser Val Gin Ala Val Tyr Val Pro Ala Asp Asp Leu 
85 90 95 

Thr Asp Pro Ala Pro Ala Thr Thr Phe Ala His Leu Asp Ala Thr Thr 
100 105 110 

Val Leu Ser Arg Gly lie Ala Glu Leu Gly lie Tyr Pro Ala Val Asp 
115 120 125 

Pro Leu Asp Ser Lys Ser Arg Met Leu Asp Pro Arg lie Val Gly Gin 
130 135 140 

Glu His Tyr Asp lie Ala Thr Lys Thr Gin Lys lie Leu Gin Asp Tyr 
145 150 155 160 

Lys Ser Leu Gin Asp lie lie Ala lie Leu Gly Met Asp Glu Leu Ser 
165 170 175 

Glu Glu Asp Lys Leu Thr Val Glu Arg Ala Arg Lys lie Gin Arg Phe 
180 185 190 

Met Ser Gin Pro Phe Ala Val Ala Gin Val Phe Thr Gly lie Glu Gly 
195 200 205 

Lys Leu Val Pro Leu Lys Thr Thr Leu Glu Ser Phe Lys Glu Leu Leu 
210 215 220 

Ser Gly Ala Cys Asp His Leu Pro Glu Ser Ala Phe Tyr Met Val Gly 
225 230 235 240 

Asp He Ala Asp Val Lys Ala Lys Ala Ala Ala Gin Ala Lys Glu Leu 
245 250 . 255 

Ala Ala 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichoderma reesei 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 50. .361 

(D) OTHER INFORMATION: /partial 
/codon_start= 50 

/product= "ATPase subunit, partial sequence" 
/gene= "ATP 2 " 

/standard_name= "beta subunit of Fl portion of the 
F0F1 ATPase" 
/label= beta-subunit 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TACTCGAAGA ATTCGGCACG AGGCTGATTG CTCTCGGTCA TCTGCCAAG ATG TTC 55 

Met Phe 
260 

AAG AGC GGC GTT TCG TCC CTC GCC AGG GCT GCC CGC CCA TCA ATT ACC 103 
Lys Ser Giy Val Ser Ser Leu Ala Arg Ala Ala Arg Pro Ser lie Thr 
265 270 275 

GCT CGA CGA GCT ATC CGA CCA GCC TTC CCT CGA ACC CCC CTC GCG AGG 151 
Ala Arg Arg Ala lie Arg Pro Ala Phe Pro Arg Thr Pro Leu Ala Arg 
280 285 290 

CTT GCC AGC ACC CAG AGC GTC GGA GAT GGC AAG ATC CAC CAG GTC ATT 199 
Leu Ala Ser Thr Gin Ser Val Gly Asp Gly Lys lie His Gin Vai lie 
295 300 305 

GGT GCC GTC GTC GAC GTC AAG TTC GAC ACC GCC AAG CTG CCT CCT ATC 247 
Gly Ala Val Val Asp Val Lys Phe Asp Thr Ala Lys Leu Pro Pro lie 
310 315 320 

CTG AAC GCC CTG GAG ACC ACC AAC AAC AAC CAG AAG CTG GTC CTC GAG 295 
Leu Asn Ala Leu Glu Thr Thr Asn Asn Asn Gin Lys Leu Val Leu Glu 
325 330 335 340 

GTG GCT CAA CAC TTG GGC GAG AAT GTC GTT CGC TGC ATT GCC ATG GAC 343 
Val Ala Gin His Leu Gly Glu Asn Val Val Arg Cys lie Ala Met Asp 
345 350 355 

GGA TCC GAG GGT CTC GTC GTGGTTCCAA GGCA 37 5 

Gly Ser Glu Gly Leu Val 
360 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS; 
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(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Phe Lys Ser Gly Val Ser Ser Leu Ala Arg Ala Ala Arg Pro Set 
1 5 10 15 

lie Thr Ala Arg Arg Ala lie Arg Pro Ala Phe Pro Arg Thr Pro Leu 
20 25 30 

Ala Arg Leu Ala Ser Thr Gin Ser Val Gly Asp Gly Lys lie His Gin 
35 40 45 

Val lie Gly Ala Val Val Asp Val Lys Phe Asp Thr Ala Lys Leu Pro 
50 55 60 

Pro lie Leu Asn Ala Leu Glu Thr Thr Asn Asn Asn Gin Lys Leu Val 
65 70 75 80 

Leu Glu Val Ala Gin His Leu Gly Glu Asn Val Val Arg Cys lie Ala 
85 90 95 



Met Asp Gly Ser Glu Gly Leu Val 
100 
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PATENT CLAIMS 

1. A method of improving the production of biomass or a 
desired product from a cell, characterized by expressing 

5 an uncoupled ATPase activity in said cell to induce con- 
version of ATP to ADP without primary effects on other 
cellular metabolites or functions, and incubating the 
cell with a suitable substrate to produce said biomass or 
product . 

10 

2. A method according to claim 1, characterized by ex- 
pressing in said cell the soluble part (Fi) of the mem- 
brane bound (FqFi type) H + -ATPase or a portion of Fi ex- 
hibiting ATPase activity. 

15 

3. A method according to claim 1 or 2 , wherein said cell 
is a prokaryotic cell. 

4. A method according to claim 3, wherein said cell is 
20 selected from the group consisting of bacteria belonging 

to the genera Lactococcus, Streptococcus, Enterococcus , 
Lactobacillus, Leuconostoc, Escherichia, Zymomonas , Ba- 
cillus and Pseudomonas. 

25 5 . A method according to claim 1 or 2 , wherein said cell 
is a eukaryotic cell. 

6. A method according to claim 5, wherein said cell is a 
yeast cell. 

30 

7. A method according to claim 6, wherein said cell be- 
longs to Saccharomyces cerevisiae or Trichoderma reesei. 

8. A method according to any one of claims 1-7, wherein 
35 said cell is transformed or transfected with an expres- 
sion vector including DNA encoding Fi or a portion 
thereof exhibiting ATPase activity under the control of a 
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promoter functioning in said cell, and said DMA is ex- 
pressed in the cell. 

9. A method according to claim 8, wherein said DNA en- 
5 'coding Fi or a portion thereof is homologous to said 

cell. 

10. A method according to claim 8, wherein said DNA en- 
coding Fi or a portion thereof is heterologous to said 

10 cell. 

11. A method according to any one of claims 8-10, 
wherein said DNA encoding Fi or a portion thereof is de- 
rived from a prokaryotic organism. 

15 

12. A method according to claim 11, wherein said DNA en- 
coding Fi or a portion thereof is derived from Esche- 
richia coli f Lactococcus lactis or Streptococcus thermd- 
philus and is selected from the group consisting of the 

20 gene encoding the Fi subunit p or a portion thereof and 
various combinations of said gene or portion with the 
genes encoding the Fi subunits 5, a, y and e or portions 
thereof • 

25 13. A method according to claim 12, wherein said DNA en- 
coding Fi or a portion thereof is selected from the group 
consisting of the Escherichia coli. Streptococcus thermo- 
philus and Lactococcus lactis genes a tpHAGDC (coding for 
subunits 5, a, y, p, e) , atpAGDC (coding for subunits a, 

30 it Pr e), atpAGD (coding for subunits a, y, p), atpDC 
(coding for subunits p, e) and atpD (coding for subunit p 
alone) . 

14. A method according to any one of claims 8-10, 
35 wherein said DNA encoding Fi or a portion thereof is de- 
rived from a eukaryotic organism. 
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15. A method according to claim 14, wherein said DNA en- 
coding Fi or a portion thereof is derived from Saccharo- 
myces cerevisiae, Phaffia rhodozyma or Trichoderma reesei 
and is selected from the group consisting of the gene en- 

5 coding the Fi subunit p or a portion thereof and various 
combinations of said gene or portion with the genes en- 
coding the other Fi subunits or portions thereof. 

16. A vector including DNA encoding the soluble part 
10 (Fi) of the membrane bound (FqFi type) H + -ATPase or a 

portion of Fi exhibiting ATPase activity, said DNA being 
■- derived from Lactococcus lactis subsp. cremoris and hav- 
ing the sequence- stated in SEQ ID No. 1. 

15 17. A vector including DNA encoding the soluble part 
(Fi) of the membrane bound (FqFi type) H + -ATPase or a 
portion of Fi exhibiting ATPase activity, said DNA being 
derived from Lactococcus lactis subsp. lactis and having 
the sequence stated in SEQ ID No. 6. 

20 

18. A vector including DNA encoding the soluble part 
(Fi) of the membrane bound (FqFi type) H + -ATPase or a 
portion of Fi exhibiting ATPase activity, said DNA being 
derived from Streptococcus thermophilus and having the 

25 sequence stated in SEQ ID No. 10. 

19. A vector including DNA encoding the soluble part 
(Fi) of the membrane bound (FqFi type) H + -ATPase or a 
portion of Fi exhibiting ATPase activity, said DNA being 

30 derived from Phaffia rhodozyma and having the sequence 
stated in SEQ ID No. 14. 

20. A vector including DNA encoding the soluble part 
(Fi) of the membrane bound (FoFi type) H + -ATPase or a 

35 portion of Fi exhibiting ATPase activity, said DNA being 
derived from Trichoderma reesei and having the sequence 
stated in SEQ ID No. 16. 
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21. An expression vector including DNA as defined in any 
one of claims 16-20 under the control of a promoter capa- 
ble of directing the expression of said DNA in a prokary- 
otic or eukaryotic cell. 

5 ' 

22. A method of optimizing the formation of biomass or a 
desired product by a cell, characterized by expressing 
different levels of uncoupled ATPase activity in the 
cell, incubating the cell on a suitable substrate, meas- 

10 uring the conversion rate of substrate into biomass or 
the desired product at each level of ATPase expression, 
and choosing a level of ATPase expression at which the 
conversion rate is optimized. 

15 23. A method according to claim 22, wherein a number of 
specimens of said cell are transformed or transfected 
with their respective expression vector each including 
DNA encoding a different portion of the cytoplasmic part 
(Fi) of the membrane bound (FoFi type) H + -ATPase up to 

20 and including the entire Fi, each portion exhibiting ATP- 
ase activity, said DNA in each expression vector being 
under the control of a promoter functioning in said cell, 
incubating each cell specimen on a suitable substrate , 
measuring the conversion rate of substrate into biomass 

25 or the desired product by each specimen, and choosing a 
specimen yielding an optimized conversion rate. 

24. A method according to claim 22, wherein a number of 
specimens of said cell are transformed or transfected 

30 with their respective expression vector including DNA en- 
coding a portion of the cytoplasmic part (Fi) of the mem- 
brane bound (FoFi type) H + -ATPase up to and including the 
entire F\, said portion exhibiting ATPase activity, said 
DNA in the respective expression vectors being under the 

35 control of each of a series of promoters covering a broad 
range of promoter activities and functioning in said 
cell, incubating each cell specimen on a suitable sub- 
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strate, measuring the conversion rate of substrate into 
biomass or the desired product by each specimen, and 
choosing a specimen yielding an optimized conversion 
rate, 

5 

25. A method according to claim 24 , wherein the respec- 
tive expression vectors include DNA encoding different 
such portions of Fi up to and including the entire Fi, 
each DNA in respective expression vectors being under the 

10 control of each of a series of promoters covering a broad 
range of promoter activities and functioning in said 
cell. 

26. A method according to any one of claims 23-25, 
15 wherein the promoter in each expression vector is an in- 
ducible promoter, and each cell specimen is grown at dif- 
ferent concentrations of inducer. 

27. A method according to any one of claims 23-26, 
20 wherein said DNA encoding a portion of Fi up to and in- 
cluding the entire Fi is homologous to said cell. 

28. A method according to any one of claims 23-26 , 
wherein said DNA encoding a portion of Fi up to and in- 

25 eluding the entire Fi is heterologous to said cell. 

29 . A method according to any one of claims 23-28 , 
wherein said DNA encoding a portion of Fi up to and in- 
cluding the entire Fi is derived from a prokaryotic or- 

30 ganism. 

30. A method according to claim 29, wherein said DNA en- 
coding a portion of Fi up to and including the entire Fi 
is derived from Escherichia coli, Lactococcus lactis or 

35 Streptococcus thermophilus and is selected from the group 
consisting of the gene encoding the Fi subunit p or a 
portion thereof and various combinations of said gene or 
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portion with the genes encoding the Fi subunits 5, a, y 
and 8 or portions thereof. 

31. A method according to claim 30, wherein said DNA en- 
5 * coding a portion of Fi up to and including the entire Fi 

is selected from the group consisting of the E. coli 
genes atpAGDC (coding for subunits a, y, p, e) , atpAGD 
(coding for subunits a, y, p), atpDC (coding for subunits 
P, e) and atpD (coding for subunit P alone), 

10 

32. A method according to any one of claims 23-28, 
wherein said DNA encoding a portion of Fi up to and in- 
cluding the entire Fi is derived from a eukaryotic organ- 
ism. 

15 

33. A method according to claim 32, wherein said DNA. en- 
coding Fi or a portion thereof is derived from Saccharo- 
myces cerevisiae , Phaffia rhodozyma or Trichoderma reesei 
and is selected from the group consisting of the gene en- 

20 coding the Fi subunit P or a portion thereof and various 
combinations of said gene or portion with the genes r en- 
coding the other Fx subunits or portions thereof. 
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